Composing Concepts from Images and Videos via Concept-prompt Binding Paper • 2512.09824 • Published 17 days ago • 27
Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds Paper • 2511.08892 • Published Nov 12 • 201
OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs Paper • 2510.10689 • Published Oct 12 • 46
TC-Light: Temporally Consistent Relighting for Dynamic Long Videos Paper • 2506.18904 • Published Jun 23 • 10