Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published 2 days ago • 18
Transition Models: Rethinking the Generative Learning Objective Paper • 2509.04394 • Published 2 days ago • 18 • 3
InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions Paper • 2506.09984 • Published Jun 11 • 15
Unleashing Vecset Diffusion Model for Fast Shape Generation Paper • 2503.16302 • Published Mar 20 • 44
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Paper • 2410.21220 • Published Oct 28, 2024 • 10
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines Paper • 2410.21220 • Published Oct 28, 2024 • 10 • 2