junyoung-00
/

Phi-3.5-vision-instruct-ChartCap

text-generation

chart-captioning

vision-language-model

Model card Files Files and versions

junyoung-00 commited on Sep 23

Commit

937db36

·

verified ·

1 Parent(s): a4dafaf

Update README.md

Files changed (1) hide show

README.md +20 -7

README.md CHANGED Viewed

@@ -17,10 +17,23 @@ This repository contains the model presented in the paper [**ChartCap: Mitigatin
 ## Model Description
-`Phi-3.5-vision-instruct-ChartCap` is a ChartCap-fine-tuned version of microsoft/Phi-3.5-vision-instruct.
 The model aims to generate high-quality, dense captions for charts, ensuring that the generated text accurately captures structural elements and key insights discernible from the charts, while mitigating the inclusion of extraneous or hallucinated information.
 ## How to Use
 ```python
@@ -65,10 +78,10 @@ print(response.strip())
 If you find this model or the associated research helpful, please cite:
 ```bibtex
-@inproceedings{{lim2025chartcap,
-  title={{ChartCap: Mitigating Hallucination of Dense Chart Captioning}},
-  author={{Junyoung Lim and Jaewoo Ahn and Gunhee Kim}},
-  booktitle={{Proceedings of the IEEE/CVF International Conference on Computer Vision}},
-  year={{2025}}
-}}
 ```

 ## Model Description
+`Phi-3.5-vision-instruct-ChartCap` is a ChartCap-fine-tuned version of  [microsoft/Phi-3.5-vision-instruct](https://huggingface.co/microsoft/Phi-3.5-vision-instruct).
 The model aims to generate high-quality, dense captions for charts, ensuring that the generated text accurately captures structural elements and key insights discernible from the charts, while mitigating the inclusion of extraneous or hallucinated information.
+## Required Packages
+```bash
+flash_attn==2.5.8
+numpy==1.24.4
+Pillow==10.3.0
+Requests==2.31.0
+torch==2.3.0
+torchvision==0.18.0
+transformers==4.43.0
+accelerate==0.30.0
+```
 ## How to Use
 ```python
 If you find this model or the associated research helpful, please cite:
 ```bibtex
+@inproceedings{lim2025chartcap,
+  title = {ChartCap: Mitigating Hallucination of Dense Chart Captioning},
+  author = {Junyoung Lim and Jaewoo Ahn and Gunhee Kim},
+  booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision},
+  year = {2025}
+}
 ```