TurkuNLP
/

finnish-modernbert-large

Model card Files Files and versions

akseli-reunamo commited on Jun 25

Commit

25c3f6b

·

verified ·

1 Parent(s): b1a7ad2

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -26,6 +26,15 @@ Training was conducted on the [LUMI supercomputer](https://www.lumi-supercompute
 The project aimed to train multilingual encoder models that support long context and all official Finnish languages¹. The model can theoretically extrapolate to a context length of 128,000 tokens.
 ¹Multiple Sámi languages are spoken in Finland, but Northern Sámi is the most widespread and thus included in the training data. English is not the official language of Finland, but it is widely used. Latin was included for potential clinical use.
 ## Model Overview
 | Hyperparameter | Value  |
 | :------------- | :----: |

 The project aimed to train multilingual encoder models that support long context and all official Finnish languages¹. The model can theoretically extrapolate to a context length of 128,000 tokens.
 ¹Multiple Sámi languages are spoken in Finland, but Northern Sámi is the most widespread and thus included in the training data. English is not the official language of Finland, but it is widely used. Latin was included for potential clinical use.
+## Table of Contents
+1. [Model Overview](#model-overview)
+2. [Training](#training)
+3. [Training data](#training-data)
+4. [Evaluation results](#evaluation-results)
+5. [Ethical Considerations and Limitations](#ethical-considerations-and-limitations)
+6. [Aknowledgements](#aknowledgements)
+7. [Licence](#licence)
+8. [Citation information](#citation-information)
 ## Model Overview
 | Hyperparameter | Value  |
 | :------------- | :----: |