A set of tools to enable finetuning, evaluations, prototyping, agentic workflows etc. ATTENTION: ALWAYS DUPLICATE THESE SPACES ON OUR INFRA!!!

Causaly LTD
Enterprise
company
AI & ML interests
None defined yet.
Collections
2
Most commonly used leaderboards to check model capabilities
-
12.6k
Open LLM Leaderboard
🏆Track, rank and evaluate open LLMs and chatbots
-
283
LLM Performance Leaderboard
🐨View LLM Performance Leaderboard
-
4.07k
Chatbot Arena Leaderboard
🏆Display chatbot leaderboard statistics
-
4.87k
MTEB Leaderboard
🥇Select benchmarks and languages for text embeddings evaluation
models
None public yet
datasets
None public yet