rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published Jan 8 β’ 257
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains Paper β’ 2501.05707 β’ Published Jan 10 β’ 20
VideoRAG: Retrieval-Augmented Generation over Video Corpus Paper β’ 2501.05874 β’ Published Jan 10 β’ 67
Running on Zero 20 20 Newborn Article Impact Predict π» Use title and abstract to predict future academic impact
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper β’ 2411.03562 β’ Published Nov 5, 2024 β’ 66
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published Jan 4 β’ 90
MALT: Improving Reasoning with Multi-Agent LLM Training Paper β’ 2412.01928 β’ Published Dec 2, 2024 β’ 42
Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability Paper β’ 2411.19943 β’ Published Nov 29, 2024 β’ 58
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper β’ 2410.23918 β’ Published Oct 31, 2024 β’ 20
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders Paper β’ 2410.22366 β’ Published Oct 28, 2024 β’ 78
Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning Paper β’ 2410.22304 β’ Published Oct 29, 2024 β’ 17