BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper • 2502.07346 • Published 13 days ago • 49
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated 7 days ago
multilingual_domain_datasets Collection Multilingual datasets. Excluding those which are just a cleaned version of CC. • 3 items • Updated 7 days ago
multilingual_benchmark Collection For evaluating multilingual ability of LLMs • 1 item • Updated 11 days ago
Corpus: Evaluation datasets for ES & LATAM Collection Corpus of La Leaderboard, the open LLM leaderboard for ES & LATAM • 56 items • Updated 19 days ago • 4