Update README.md
Browse files
README.md
CHANGED
|
@@ -132,7 +132,7 @@ for i, prob in enumerate(probabilities):
|
|
| 132 |
### Training Data
|
| 133 |
The model was trained on a novel, heterogeneous corpus of 23,969 multi-labeled documents from 11 diverse sources, including government, academia, industry, and civil society, with some sources translated from Japanese. This approach was designed to address the "interpretive diversity" of SDG-related language.
|
| 134 |
|
| 135 |
-
For full details on reconstructing the training corpus, please refer to **Supplementary Information
|
| 136 |
|
| 137 |
### Evaluation
|
| 138 |
This model was selected based on its superior generalization performance (especially recall) on external datasets like the OSDG Community Dataset and the SDGi Corpus. On a human-coded sample of scientific articles, the model achieved a macro-averaged **F1-score of 0.623**. For a full breakdown of performance metrics, please see the paper.
|
|
|
|
| 132 |
### Training Data
|
| 133 |
The model was trained on a novel, heterogeneous corpus of 23,969 multi-labeled documents from 11 diverse sources, including government, academia, industry, and civil society, with some sources translated from Japanese. This approach was designed to address the "interpretive diversity" of SDG-related language.
|
| 134 |
|
| 135 |
+
For full details on reconstructing the training corpus, please refer to **Supplementary Information S4** in our paper.
|
| 136 |
|
| 137 |
### Evaluation
|
| 138 |
This model was selected based on its superior generalization performance (especially recall) on external datasets like the OSDG Community Dataset and the SDGi Corpus. On a human-coded sample of scientific articles, the model achieved a macro-averaged **F1-score of 0.623**. For a full breakdown of performance metrics, please see the paper.
|