Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jason500
's Collections
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
caption
MMLM
MMLM
updated
Mar 24
Upvote
-
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Jun 19
•
15.7k
•
161
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Apr 4
•
21.8k
•
545
zai-org/cogvlm2-llama3-caption
Video-Text-to-Text
•
13B
•
Updated
May 14
•
814
•
107
mistralai/Pixtral-12B-Base-2409
Updated
Jul 28
•
17
•
104
mistralai/Pixtral-12B-2409
Updated
Jul 28
•
1.77k
•
661
zai-org/glm-4v-9b
14B
•
Updated
Mar 3
•
569k
•
264
OpenGVLab/InternVL-Chat-V1-2-SFT-Data
Viewer
•
Updated
Sep 20, 2024
•
573k
•
634
•
24
weic22/InstructSeg
3B
•
Updated
Dec 19, 2024
•
7
•
3
mistralai/Mistral-Small-3.1-24B-Instruct-2503
24B
•
Updated
Jul 28
•
294k
•
1.31k
Upvote
-
Share collection
View history
Collection guide
Browse collections