This is excellent @ngxson .. no errors and very crisp.
Pratik Bhavsar PRO
pratikbhavsar
AI & ML interests
LLM agents, evaluation & reasoning
Recent Activity
commented on
their
article
6 days ago
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
commented on
their
article
6 days ago
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
updated
a Space
9 days ago
galileo-ai/agent-leaderboard
Organizations
pratikbhavsar's activity

commented on
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
6 days ago

commented on
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
6 days ago
Thank you Erin! We will continue to update this further with more LLMs :)

published
an
article
11 days ago
Article
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios
By
and 1 other
•
•
13
upvoted
an
article
24 days ago
Article
Open-R1: a fully open reproduction of DeepSeek-R1
•
770
bespokelabs/Bespoke-Stratos-17k
Viewer
•
Updated
•
16.7k
•
94.4k
•
278
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
•
228k
•
103k
•
588
PrimeIntellect/NuminaMath-QwQ-CoT-5M
Viewer
•
Updated
•
5.14M
•
3.21k
•
48
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
•
1.85M
•
7.3k
•
255
cognitivecomputations/dolphin-r1
Viewer
•
Updated
•
814k
•
5.75k
•
263

upvoted
a
collection
24 days ago
Does this have tooling support?
4
#7 opened about 1 month ago
by
xceptor


upvoted
a
collection
6 months ago