Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
yali30
/
findingdory-qwen2.5-VL-3B-finetuned
like
3
Image-Text-to-Text
Transformers
Safetensors
yali30/findingdory
English
qwen2_5_vl
image-to-text
habitat
embodied-ai
memory
conversational
text-generation-inference
arxiv:
2506.15635
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
findingdory-qwen2.5-VL-3B-finetuned
/
README.md
Commit History
Update README.md (
#2
)
0ec4169
verified
yali30
ykarmesh
commited on
Jul 7
Create README.md (
#1
)
dc5c9a3
verified
yali30
ykarmesh
commited on
Jul 1