benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 7.29k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 327 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 13.9k • 81 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 93.1k • 331
benchmark haonan-li/cmmlu Viewer • Updated Jul 13, 2023 • 11.9k • 7.29k • 72 nlp-waseda/JMMLU Updated Feb 27, 2024 • 327 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 13.9k • 81 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 93.1k • 331