Avada11 nielsr HF Staff commited on
Commit
914b3f9
·
verified ·
1 Parent(s): 5aedf1d

Improve model card: Add depth-estimation pipeline tag, update license, and enhance content (#1)

Browse files

- Improve model card: Add depth-estimation pipeline tag, update license, and enhance content (2be846df561d77f1a3905c6d974c01f4b0e989b9)


Co-authored-by: Niels Rogge <[email protected]>

Files changed (1) hide show
  1. README.md +46 -4
README.md CHANGED
@@ -1,9 +1,51 @@
1
  ---
2
- license: cc-by-nc-4.0
 
3
  ---
4
 
5
- This repository contains the camera depth model of the paper Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots.
6
 
7
- Model inference guide: https://github.com/ByteDance-Seed/manip-as-in-sim-suite/tree/main/cdm
 
8
 
9
- Project page: https://manipulation-as-in-simulation.github.io
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
+ pipeline_tag: depth-estimation
4
  ---
5
 
6
+ # Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots
7
 
8
+ This repository contains the Camera Depth Models (CDMs) from the paper:
9
+ [Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots](https://huggingface.co/papers/2509.02530).
10
 
11
+ ## Overview
12
+
13
+ Camera Depth Models (CDMs) are proposed as a simple plugin for daily-use depth cameras. They take RGB images and raw depth signals as input and output denoised, accurate metric depth. This enables policies trained purely in simulation to transfer directly to real robots by providing nearly simulation-level accurate depth perception.
14
+
15
+ ## Links
16
+
17
+ * **Project Page**: [https://manipulation-as-in-simulation.github.io](https://manipulation-as-in-simulation.github.io/)
18
+ * **Code Repository**: [https://github.com/ByteDance-Seed/manip-as-in-sim-suite](https://github.com/ByteDance-Seed/manip-as-in-sim-suite)
19
+
20
+ ## Usage
21
+
22
+ For detailed installation instructions and further usage examples, please refer to the [CDM documentation in the GitHub repository](https://github.com/ByteDance-Seed/manip-as-in-sim-suite/tree/main/cdm).
23
+
24
+ ### CDM Inference Example
25
+
26
+ To run depth inference on RGB-D camera data, use the following command:
27
+
28
+ ```bash
29
+ cd cdm
30
+ python infer.py \
31
+ --encoder vitl \
32
+ --model-path /path/to/model.pth \
33
+ --rgb-image /path/to/rgb.jpg \
34
+ --depth-image /path/to/depth.png \
35
+ --output result.png
36
+ ```
37
+
38
+ ## Citation
39
+
40
+ If you use this work in your research, please cite:
41
+
42
+ ```bibtex
43
+ @article{liu2025manipulation,
44
+ title={Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots},
45
+ author={Liu, Minghuan and Zhu, Zhengbang and Han, Xiaoshen and Hu, Peng and Lin, Haotong and
46
+ Li, Xinyao and Chen, Jingxiao and Xu, Jiafeng and Yang, Yichu and Lin, Yunfeng and
47
+ Li, Xinghang and Yu, Yong and Zhang, Weinan and Kong, Tao and Kang, Bingyi},
48
+ journal={arXiv preprint},
49
+ year={2025}
50
+ }
51
+ ```