LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model Paper • 2509.00676 • Published 8 days ago • 74
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 17 days ago • 83
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling Paper • 2508.17445 • Published 14 days ago • 78
ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT Text Generation • 15B • Updated 13 days ago • 55 • 1
ReasoningTransferability/UniReason-Qwen3-14B-think-SFT Text Generation • 15B • Updated 13 days ago • 52
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving Paper • 2507.06229 • Published Jul 8 • 73
ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT Text Generation • 15B • Updated 13 days ago • 55 • 1
ReasoningTransferability/UniReason-Qwen3-14B-think-SFT Text Generation • 15B • Updated 13 days ago • 52
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1 • 76