No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes Paper • 2508.19060 • Published 12 days ago • 8
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published 11 days ago • 20
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning Paper • 2508.21104 • Published 10 days ago • 28
UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat Paper • 2508.17378 • Published 14 days ago • 6
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench Paper • 2508.20931 • Published 10 days ago • 15
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents Paper • 2508.17198 • Published 14 days ago • 6
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 17 days ago • 83
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 17 days ago • 83 • 4
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 17 days ago • 83 • 4
Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 17 days ago • 83