Running on CPU Upgrade 282 282 GAIA Leaderboard 🦾 Submit models for evaluation and view leaderboard scores
Running on CPU Upgrade 63 63 LeaderboardExplorer 🔎 Filter and display leaderboards based on selected criteria
Running on CPU Upgrade 12.6k 12.6k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
Running 90 90 Nexus Function Calling Leaderboard 🐠 Visualize model performance on function calling tasks
Running on CPU Upgrade 4.87k 4.87k MTEB Leaderboard 🥇 Select benchmarks and languages for text embeddings evaluation