Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jason500
's Collections
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
caption
MMLM
caption
updated
Nov 7, 2024
Upvote
-
zai-org/cogvlm2-llama3-caption
Video-Text-to-Text
•
13B
•
Updated
May 14
•
814
•
107
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Feb 6
•
9.97k
•
•
306
Upvote
-
Share collection
View history
Collection guide
Browse collections