Image-Text-to-Text
Transformers
Safetensors
Cosmos
English
qwen2_5_vl
image-to-text
nvidia
conversational
text-generation-inference
zekunhao's picture
08/01 release: Added support for spatial-temporal reasoning of city and industrial operations
0caf724