H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos Paper • 2512.09406 • Published 21 days ago • 3
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos Paper • 2512.09406 • Published 21 days ago • 3
H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos Paper • 2512.09406 • Published 21 days ago • 3
DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection Paper • 2511.19111 • Published Nov 24 • 3
DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection Paper • 2511.19111 • Published Nov 24 • 3 • 2
view article Article Metric and Relative Monocular Depth Estimation: An Overview. Fine-Tuning Depth Anything V2 👐 📚 Jul 10, 2024 • 90
ROICtrl: Boosting Instance Control for Visual Generation Paper • 2411.17949 • Published Nov 27, 2024 • 87