Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jason500
's Collections
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
caption
MMLM
mutilmodal_video2text
updated
Dec 26, 2024
Upvote
-
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
8B
•
Updated
Oct 10, 2024
•
376
•
23
alibaba-pai/VideoCLIP-XL
Updated
Oct 7, 2024
•
21
zai-org/cogvlm2-llama3-caption
Video-Text-to-Text
•
13B
•
Updated
May 14
•
814
•
107
OpenGVLab/InternVideo2-Stage2_1B-224p-f4
Updated
Apr 14, 2024
•
15
Runtime error
86
86
LongVU
🌖
Generate responses to video or image inputs
Upvote
-
Share collection
View history
Collection guide
Browse collections