These datasets are used to evaluate models on French performance using: https://github.com/EleutherAI/lm-evaluation-harness (from CroissantLLM paper)
Manuel Faysse
manu
AI & ML interests
NLP, Privacy, multi-modal DL
Recent Activity
upvoted an article 2 days ago
BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders new activity 11 days ago
manu/bge-fr-en:Size of dataset liked a model about 1 month ago
athrael-soju/colqwen3.5-4.5B-v1