Submitted by Sylvestre 40 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion · 6 authors 1
Submitted by philschmid 38 Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models · 6 authors 1
Submitted by zuom 16 Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages · 5 authors 1