Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,36 @@ pipeline_tag: document-question-answering
|
|
13 |
|
14 |
Idefices2 8B fine-tuned on 800+ multi-page documents for Visual DocQA. Make sure you have the latest peft and transformers before loading the model. GPU is required for it to work properly.
|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
## Model Details
|
17 |
|
18 |
### Model Description
|
@@ -137,14 +167,14 @@ TODO
|
|
137 |
|
138 |
#### Testing Data
|
139 |
|
140 |
-
|
141 |
|
142 |
#### Metrics
|
143 |
|
144 |
-
|
145 |
|
146 |
### Results
|
147 |
|
148 |
-
|
149 |
|
150 |
#### Summary
|
|
|
13 |
|
14 |
Idefices2 8B fine-tuned on 800+ multi-page documents for Visual DocQA. Make sure you have the latest peft and transformers before loading the model. GPU is required for it to work properly.
|
15 |
|
16 |
+
Compared to the base model, it has a lower edit distance (53% improvement on micro average) on the test set.
|
17 |
+
|
18 |
+
| | Category | Idefics2-8B | Idefics2-8B-EDGAR | Δ(↑) |
|
19 |
+
|---:|:----------------------------|--------------:|--------------------:|:-------|
|
20 |
+
| 0 | agreement_date | 0.878489 | 0.0999479 | 88.62% |
|
21 |
+
| 1 | agreement_term | 0.907067 | 0.438816 | 51.62% |
|
22 |
+
| 2 | auto_renewal | 0.634946 | 0.0516129 | 91.87% |
|
23 |
+
| 3 | contract_value | 0.474438 | 0.418815 | 11.72% |
|
24 |
+
| 4 | counterparty_address | 0.771387 | 0.59835 | 22.43% |
|
25 |
+
| 5 | counterparty_name | 0.825491 | 0.633359 | 23.27% |
|
26 |
+
| 6 | counterparty_signer_name | 0.842091 | 0.480444 | 42.95% |
|
27 |
+
| 7 | counterparty_signer_title | 0.61746 | 0.496041 | 19.66% |
|
28 |
+
| 8 | effective_date | 0.903268 | 0.125641 | 86.09% |
|
29 |
+
| 9 | expiration_date | 0.88673 | 0.235197 | 73.48% |
|
30 |
+
| 10 | governing_law | 0.881037 | 0.308771 | 64.95% |
|
31 |
+
| 11 | opt_out_length | 0.431548 | 0.047619 | 88.97% |
|
32 |
+
| 12 | party_address | 0.730897 | 0.608301 | 16.77% |
|
33 |
+
| 13 | party_name | 0.726411 | 0.490194 | 32.52% |
|
34 |
+
| 14 | payment_frequency | 0.686123 | 0.373724 | 45.53% |
|
35 |
+
| 15 | payment_term | 0.854552 | 0.593333 | 30.57% |
|
36 |
+
| 16 | renewal_term | 0.92829 | 0.0595238 | 93.59% |
|
37 |
+
| 17 | termination_for_cause | 0.436 | 0.048 | 88.99% |
|
38 |
+
| 18 | termination_for_convenience | 0.628261 | 0.156522 | 75.09% |
|
39 |
+
| 19 | termination_notice_period | 0.329748 | 0.178394 | 45.90% |
|
40 |
+
| 20 | venue | 0.781417 | 0.61403 | 21.42% |
|
41 |
+
|
42 |
+
|
43 |
+
|
44 |
+
data:image/s3,"s3://crabby-images/aa73b/aa73b06fa6f9cc97643ab64e8001466037f65bc4" alt="image/png"
|
45 |
+
|
46 |
## Model Details
|
47 |
|
48 |
### Model Description
|
|
|
167 |
|
168 |
#### Testing Data
|
169 |
|
170 |
+
20% percent of the whole dataset.
|
171 |
|
172 |
#### Metrics
|
173 |
|
174 |
+
Edit Distance (nltk).
|
175 |
|
176 |
### Results
|
177 |
|
178 |
+
See above.
|
179 |
|
180 |
#### Summary
|