OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity Paper • 2409.19987 • Published Sep 30, 2024
LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment Paper • 2403.13307 • Published Mar 20, 2024
Epona: Autoregressive Diffusion World Model for Autonomous Driving Paper • 2506.24113 • Published Jun 30 • 1
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization Paper • 2507.07997 • Published Jul 10 • 1
Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering Paper • 2309.09724 • Published Sep 18, 2023
PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion Paper • 2312.09069 • Published Dec 14, 2023
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models Paper • 2308.05733 • Published Aug 10, 2023
Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth Paper • 2202.01470 • Published Feb 3, 2022
GIM: Learning Generalizable Image Matcher From Internet Videos Paper • 2402.11095 • Published Feb 16, 2024 • 3
GaussianPro: 3D Gaussian Splatting with Progressive Propagation Paper • 2402.14650 • Published Feb 22, 2024 • 8
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image Paper • 2403.12013 • Published Mar 18, 2024
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction Paper • 2409.18124 • Published Sep 26, 2024 • 34
Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving Paper • 2410.22313 • Published Oct 29, 2024
Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration Paper • 2411.17240 • Published Nov 26, 2024
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving Paper • 2503.05689 • Published Mar 7 • 3
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image Paper • 2307.10984 • Published Jul 20, 2023 • 2
Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation Paper • 2404.15506 • Published Mar 22, 2024
DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model Paper • 2410.10429 • Published Oct 14, 2024
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT Paper • 2412.19505 • Published Dec 27, 2024 • 1