villa-X: A Vision-Language-Latent-Action Model

arXivProjectCode

How to use

Check out https://github.com/microsoft/villa-x/

Downloads last month

-

Downloads are not tracked for this model. How to track
Video Preview
loading