282
GAIA Leaderboard
π¦Ύ
Submit models for evaluation and view leaderboard scores
Submit models for evaluation and view leaderboard scores
Compare model answers to questions
Track, rank and evaluate open LLMs and chatbots
Explore and filter language model benchmark results
Run a Streamlit web app
Evaluate language models automatically
Display and explore model leaderboards and chat history