Submitted by jymcc 56 CoD, Towards an Interpretable Medical Agent using Chain of Diagnosis · 7 authors 4
Submitted by akhaliq 30 MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence · 7 authors 2
Submitted by akhaliq 27 OutfitAnyone: Ultra-high Quality Virtual Try-On for Any Clothing and Any Person · 11 authors 5
Submitted by Kaiyue 26 T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation · 7 authors 4
Submitted by thuhsy 14 F-HOI: Toward Fine-grained Semantic-Aligned 3D Human-Object Interactions · 5 authors 3
Submitted by akhaliq 13 INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model · 7 authors 3
Submitted by akhaliq 12 A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data · 7 authors 2
Submitted by akhaliq 6 Cross Anything: General Quadruped Robot Navigation through Complex Terrains · 5 authors 2