ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks Paper • 2508.08240 • Published 27 days ago • 43
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Paper • 2508.14811 • Published 18 days ago • 39
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models Paper • 2403.11627 • Published Mar 18, 2024
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition Paper • 2405.13870 • Published May 22, 2024
LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis Paper • 2412.15214 • Published Dec 19, 2024 • 15
FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior Paper • 2407.04947 • Published Jul 6, 2024
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration Paper • 2505.20256 • Published May 26 • 17
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published 26 days ago • 36
Test-Time Reinforcement Learning for GUI Grounding via Region Consistency Paper • 2508.05615 • Published Aug 7 • 21
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published 26 days ago • 36
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models Paper • 2508.09138 • Published 26 days ago • 36 • 2
Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation Paper • 2507.22886 • Published Jul 30 • 9
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization Paper • 2507.15758 • Published Jul 21 • 34