3 3

Wei Yin

WonderingWorld

https://yvanyin.xyz/

YvanYin

AI & ML interests

CV, DL

Recent Activity

liked a dataset about 1 month ago

mkjia/UHDBench

authored a paper about 1 month ago

OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity

authored a paper about 1 month ago

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

View all activity

Organizations

None yet

authored 4 papers about 1 month ago

OccRWKV: Rethinking Efficient 3D Semantic Occupancy Prediction with Linear Complexity

Paper • 2409.19987 • Published Sep 30, 2024

LaserHuman: Language-guided Scene-aware Human Motion Generation in Free Environment

Paper • 2403.13307 • Published Mar 20, 2024

Epona: Autoregressive Diffusion World Model for Autonomous Driving

Paper • 2506.24113 • Published Jun 30 • 1

MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization

Paper • 2507.07997 • Published Jul 10 • 1

authored 16 papers 6 months ago

Towards 3D Scene Reconstruction from Locally Scale-Aligned Monocular Video Depth

Paper • 2202.01470 • Published Feb 3, 2022

GIM: Learning Generalizable Image Matcher From Internet Videos

Paper • 2402.11095 • Published Feb 16, 2024 • 3

GaussianPro: 3D Gaussian Splatting with Progressive Propagation

Paper • 2402.14650 • Published Feb 22, 2024 • 8

GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Paper • 2403.12013 • Published Mar 18, 2024

Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

Paper • 2409.18124 • Published Sep 26, 2024 • 34

Depth Any Video with Scalable Synthetic Data

Paper • 2410.10815 • Published Oct 14, 2024 • 2

Senna: Bridging Large Vision-Language Models and End-to-End Autonomous Driving

Paper • 2410.22313 • Published Oct 29, 2024

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

Paper • 2411.17240 • Published Nov 26, 2024

GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

Paper • 2503.05689 • Published Mar 7 • 3

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

Paper • 2307.10984 • Published Jul 20, 2023 • 2

Metric3D v2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation

Paper • 2404.15506 • Published Mar 22, 2024

DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model

Paper • 2410.10429 • Published Oct 14, 2024

DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT

Paper • 2412.19505 • Published Dec 27, 2024 • 1

Wei Yin

AI & ML interests

Recent Activity

Organizations

WonderingWorld's activity