Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion Paper • 2506.08009 • Published Jun 9 • 28
InterActHuman: Multi-Concept Human Animation with Layout-Aligned Audio Conditions Paper • 2506.09984 • Published Jun 11 • 15
Hunyuan-Game: Industrial-grade Intelligent Game Creation Model Paper • 2505.14135 • Published May 20 • 15
FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios Paper • 2505.03730 • Published May 6 • 28
InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity Paper • 2503.16418 • Published Mar 20 • 36
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams Paper • 2406.08085 • Published Jun 12, 2024 • 17
Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model Paper • 2408.00754 • Published Aug 1, 2024 • 25