LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 20
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 20
Leaderboards Running 513 513 Image Arena Leaderboard 📊 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.25k 6.25k MTEB Leaderboard 🥇 Embedding Leaderboard Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots Running 4.59k 4.59k LMArena Leaderboard 🏆 Display LMArena Leaderboard
Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
LLM TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38 User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 20
TravelPlanner: A Benchmark for Real-World Planning with Language Agents Paper • 2402.01622 • Published Feb 2, 2024 • 38
User-LLM: Efficient LLM Contextualization with User Embeddings Paper • 2402.13598 • Published Feb 21, 2024 • 20
Leaderboards Running 513 513 Image Arena Leaderboard 📊 Image Generation and Image Editing Arena & Leaderboard Running on CPU Upgrade 6.25k 6.25k MTEB Leaderboard 🥇 Embedding Leaderboard Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots Running 4.59k 4.59k LMArena Leaderboard 🏆 Display LMArena Leaderboard
Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots