Update README.md
Browse files
README.md
CHANGED
@@ -26,6 +26,15 @@ Training was conducted on the [LUMI supercomputer](https://www.lumi-supercompute
|
|
26 |
The project aimed to train multilingual encoder models that support long context and all official Finnish languages¹. The model can theoretically extrapolate to a context length of 128,000 tokens.
|
27 |
|
28 |
¹Multiple Sámi languages are spoken in Finland, but Northern Sámi is the most widespread and thus included in the training data. English is not the official language of Finland, but it is widely used. Latin was included for potential clinical use.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
## Model Overview
|
30 |
| Hyperparameter | Value |
|
31 |
| :------------- | :----: |
|
|
|
26 |
The project aimed to train multilingual encoder models that support long context and all official Finnish languages¹. The model can theoretically extrapolate to a context length of 128,000 tokens.
|
27 |
|
28 |
¹Multiple Sámi languages are spoken in Finland, but Northern Sámi is the most widespread and thus included in the training data. English is not the official language of Finland, but it is widely used. Latin was included for potential clinical use.
|
29 |
+
## Table of Contents
|
30 |
+
1. [Model Overview](#model-overview)
|
31 |
+
2. [Training](#training)
|
32 |
+
3. [Training data](#training-data)
|
33 |
+
4. [Evaluation results](#evaluation-results)
|
34 |
+
5. [Ethical Considerations and Limitations](#ethical-considerations-and-limitations)
|
35 |
+
6. [Aknowledgements](#aknowledgements)
|
36 |
+
7. [Licence](#licence)
|
37 |
+
8. [Citation information](#citation-information)
|
38 |
## Model Overview
|
39 |
| Hyperparameter | Value |
|
40 |
| :------------- | :----: |
|