Fixed BLIMP few-shot setting
Browse files
README.md
CHANGED
@@ -67,7 +67,7 @@ To evaluate our model, we created Basque versions of well-established English be
|
|
67 |
|
68 |
Additionally, we evaluated our model using a suite of already publicly available Basque Benchmarks:
|
69 |
|
70 |
-
- [**BL2MP**](https://huggingface.co/datasets/orai-nlp/bl2mp) (Urbizu et al., 2024) [
|
71 |
|
72 |
- [**Belebele**](https://huggingface.co/datasets/facebook/belebele) (Bandarkar et al.) [5-shot]: Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants.
|
73 |
|
|
|
67 |
|
68 |
Additionally, we evaluated our model using a suite of already publicly available Basque Benchmarks:
|
69 |
|
70 |
+
- [**BL2MP**](https://huggingface.co/datasets/orai-nlp/bl2mp) (Urbizu et al., 2024) [0-shot]: The BL2MP test set, designed to assess the grammatical knowledge of language Models in the Basque language, inspired by the BLiMP benchmark.
|
71 |
|
72 |
- [**Belebele**](https://huggingface.co/datasets/facebook/belebele) (Bandarkar et al.) [5-shot]: Belebele is a multiple-choice machine reading comprehension (MRC) dataset spanning 122 language variants.
|
73 |
|