alan
jason500
·
AI & ML interests
None yet
Organizations
None yet
text_gen_img
grounding
video_preprocess
mutil big modal image2text
-
OpenGVLab/InternVL-14B-224px
Image Feature Extraction • 14B • Updated • 165 • 35 -
openbmb/MiniCPM-V-2_6
Image-Text-to-Text • 8B • Updated • 111k • 1k -
RhapsodyAI/MiniCPM-V-Embedding-preview
Feature Extraction • Updated • 99 • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 453k • • 1.51k
MMLM
siweilian
text_gen_img
duomotai&tuxiangbianji
grounding
Quant
video_preprocess
mutilmodal_video2text
mutil big modal image2text
-
OpenGVLab/InternVL-14B-224px
Image Feature Extraction • 14B • Updated • 165 • 35 -
openbmb/MiniCPM-V-2_6
Image-Text-to-Text • 8B • Updated • 111k • 1k -
RhapsodyAI/MiniCPM-V-Embedding-preview
Feature Extraction • Updated • 99 • 52 -
meta-llama/Llama-3.2-11B-Vision-Instruct
Image-Text-to-Text • 11B • Updated • 453k • • 1.51k
caption
MMLM