Taming generative video models for zero-shot optical flow extraction Paper • 2507.09082 • Published Jul 11, 2025 • 12
CaptionQA: Is Your Caption as Useful as the Image Itself? Paper • 2511.21025 • Published Nov 26, 2025 • 27
IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos Paper • 2411.11409 • Published Nov 18, 2024