The Ultimate Collection of Code Classifiers Collection π₯ 15 classifiers, 124M parameters, one per programming languageβ for assessing the educational value of GitHub code β’ 15 items β’ Updated 3 days ago β’ 9
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper β’ 2502.11089 β’ Published 7 days ago β’ 133
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper β’ 2502.12115 β’ Published 6 days ago β’ 41
view article Article What is test-time compute and how to scale it? By Kseniase and 1 other β’ 17 days ago β’ 39
Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2 Paper β’ 2502.03544 β’ Published 18 days ago β’ 42
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper β’ 2502.07316 β’ Published 12 days ago β’ 44
Fino1: On the Transferability of Reasoning Enhanced LLMs to Finance Paper β’ 2502.08127 β’ Published 11 days ago β’ 49
Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges Paper β’ 2502.08680 β’ Published 11 days ago β’ 11
SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models Paper β’ 2502.09390 β’ Published 10 days ago β’ 16
CoT-Valve: Length-Compressible Chain-of-Thought Tuning Paper β’ 2502.09601 β’ Published 10 days ago β’ 12
Logical Reasoning in Large Language Models: A Survey Paper β’ 2502.09100 β’ Published 10 days ago β’ 21
Can this Model Also Recognize Dogs? Zero-Shot Model Search from Weights Paper β’ 2502.09619 β’ Published 10 days ago β’ 31
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper β’ 2502.08910 β’ Published 10 days ago β’ 139
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper β’ 2502.09056 β’ Published 10 days ago β’ 30
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper β’ 2502.06703 β’ Published 13 days ago β’ 133
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? Paper β’ 2502.00674 β’ Published 21 days ago β’ 12