INSAIT-Institute/BrokenMath
Viewer
•
Updated
•
15.4k
•
66
•
1
The first benchmark for evaluating LLM sycophancy in mathematical reasoning.
Totally Free + Zero Barriers + No Login Required