Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models Paper • 2510.05034 • Published Oct 6, 2025 • 48
LongLive: Real-time Interactive Long Video Generation Paper • 2509.22622 • Published Sep 26, 2025 • 184
A Survey on Video Temporal Grounding with Multimodal Large Language Model Paper • 2508.10922 • Published Aug 7, 2025 • 1
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 11 items • Updated 2 days ago • 549