Beyond Memorization: Extending Reasoning Depth with Recurrence, Memory and Test-Time Compute Scaling Paper • 2508.16745 • Published 17 days ago • 27
Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks Paper • 2508.18672 • Published 13 days ago • 9
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models Paper • 2508.18773 • Published 13 days ago • 14
UltraMemV2: Memory Networks Scaling to 120B Parameters with Superior Long-Context Learning Paper • 2508.18756 • Published 13 days ago • 36
DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis Paper • 2508.20033 • Published 12 days ago • 7
AudioStory: Generating Long-Form Narrative Audio with Large Language Models Paper • 2508.20088 • Published 12 days ago • 20
Predicting the Order of Upcoming Tokens Improves Language Modeling Paper • 2508.19228 • Published 13 days ago • 21
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper • 2508.20453 • Published 11 days ago • 57
PaperRegister: Boosting Flexible-grained Paper Search via Hierarchical Register Indexing Paper • 2508.11116 • Published 24 days ago • 22
RLPR: Extrapolating RLVR to General Domains without Verifiers Paper • 2506.18254 • Published Jun 23 • 32
Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models Paper • 2501.05767 • Published Jan 10 • 30
LLMtimesMapReduce: Simplified Long-Sequence Processing using Large Language Models Paper • 2410.09342 • Published Oct 12, 2024 • 40