HERO: Hierarchical Extrapolation and Refresh for Efficient World Models Paper • 2508.17588 • Published 14 days ago • 2
HERO: Hierarchical Extrapolation and Refresh for Efficient World Models Paper • 2508.17588 • Published 14 days ago • 2
M3Ret: Unleashing Zero-shot Multimodal Medical Image Retrieval via Self-Supervision Paper • 2509.01360 • Published 6 days ago • 11
Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery Paper • 2508.17380 • Published 14 days ago • 5
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published 10 days ago • 131
AudioStory: Generating Long-Form Narrative Audio with Large Language Models Paper • 2508.20088 • Published 11 days ago • 20
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time Autoregressive Video Generation Paper • 2508.19320 • Published 12 days ago • 27
Personalized Safety Alignment for Text-to-Image Diffusion Models Paper • 2508.01151 • Published Aug 2 • 8
X-Omni: Reinforcement Learning Makes Discrete Autoregressive Image Generative Models Great Again Paper • 2507.22058 • Published Jul 29 • 38