view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 5 days ago β’ 53
Soundwave: Less is More for Speech-Text Alignment in LLMs Paper β’ 2502.12900 β’ Published 5 days ago β’ 72
IHEval: Evaluating Language Models on Following the Instruction Hierarchy Paper β’ 2502.08745 β’ Published 11 days ago β’ 18
ReLearn: Unlearning via Learning for Large Language Models Paper β’ 2502.11190 β’ Published 7 days ago β’ 28
How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training Paper β’ 2502.11196 β’ Published 7 days ago β’ 20
Logical Reasoning in Large Language Models: A Survey Paper β’ 2502.09100 β’ Published 11 days ago β’ 21
An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging Paper β’ 2502.09056 β’ Published 11 days ago β’ 30
SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Paper β’ 2502.09604 β’ Published 10 days ago β’ 31
Skrr: Skip and Re-use Text Encoder Layers for Memory Efficient Text-to-Image Generation Paper β’ 2502.08690 β’ Published 11 days ago β’ 39
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper β’ 2502.08910 β’ Published 11 days ago β’ 139
π§ Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community β’ 12 items β’ Updated 4 days ago β’ 79
SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators Paper β’ 2502.06394 β’ Published 13 days ago β’ 85
Expect the Unexpected: FailSafe Long Context QA for Finance Paper β’ 2502.06329 β’ Published 13 days ago β’ 122
BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Paper β’ 2502.07346 β’ Published 13 days ago β’ 49
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper β’ 2502.06703 β’ Published 13 days ago β’ 134
view article Article Fine-tune Deepseek-R1 with a Synthetic Reasoning Dataset By sdiazlor β’ 13 days ago β’ 35
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper β’ 2502.06781 β’ Published 13 days ago β’ 59
Detecting AI-Generated Sentences in Human-AI Collaborative Hybrid Texts: Challenges, Strategies, and Insights Paper β’ 2403.03506 β’ Published Mar 6, 2024 β’ 1
QuEST: Stable Training of LLMs with 1-Bit Weights and Activations Paper β’ 2502.05003 β’ Published 16 days ago β’ 41
Goku: Flow Based Video Generative Foundation Models Paper β’ 2502.04896 β’ Published 16 days ago β’ 88