EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation Paper • 2411.08380 • Published Nov 13, 2024 • 27
OPERA: Omni-Supervised Representation Learning with Hierarchical Supervisions Paper • 2210.05557 • Published Oct 11, 2022
OccFormer: Dual-path Transformer for Vision-based 3D Semantic Occupancy Prediction Paper • 2304.05316 • Published Apr 11, 2023
DREAM: Efficient Dataset Distillation by Representative Matching Paper • 2302.14416 • Published Feb 28, 2023
OpenOccupancy: A Large Scale Benchmark for Surrounding Semantic Occupancy Perception Paper • 2303.03991 • Published Mar 7, 2023 • 1
SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving Paper • 2303.09551 • Published Mar 16, 2023
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving Paper • 2309.09777 • Published Sep 18, 2023 • 2
On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving Paper • 2311.05332 • Published Nov 9, 2023 • 13