MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions Paper • 2112.00431 • Published Dec 1, 2021
OpenTAD: A Unified Framework and Comprehensive Study of Temporal Action Detection Paper • 2502.20361 • Published Feb 27 • 1
MatchDiffusion: Training-free Generation of Match-cuts Paper • 2411.18677 • Published Nov 27, 2024 • 1
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think Paper • 2504.20708 • Published Apr 29 • 23