DVD: Deterministic Video Depth Estimation with Generative Priors Paper • 2603.12250 • Published 3 days ago • 18
Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence Paper • 2603.07660 • Published 7 days ago • 77
LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory Paper • 2603.03269 • Published 12 days ago • 53
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model Paper • 2603.05438 • Published 10 days ago • 35
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing Paper • 2603.02802 • Published 12 days ago • 7
Causal Motion Diffusion Models for Autoregressive Motion Generation Paper • 2602.22594 • Published 17 days ago • 7
Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling Paper • 2602.21760 • Published 18 days ago • 13
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 19 days ago • 30
On Data Engineering for Scaling LLM Terminal Capabilities Paper • 2602.21193 • Published 19 days ago • 94
tttLRM: Test-Time Training for Long Context and Autoregressive 3D Reconstruction Paper • 2602.20160 • Published 20 days ago • 10
Mobile-O: Unified Multimodal Understanding and Generation on Mobile Device Paper • 2602.20161 • Published 20 days ago • 23
Stroke3D: Lifting 2D strokes into rigged 3D model via latent diffusion models Paper • 2602.09713 • Published Feb 10 • 8
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published Feb 6 • 35
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 217
RISE-Video: Can Video Generators Decode Implicit World Rules? Paper • 2602.05986 • Published Feb 5 • 26
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published Feb 3 • 42
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published Feb 3 • 62