Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,686
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Kemy44/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Feb 10
•
4
prithivMLmods/Open-R1-Mini-Experimental
Image-Text-to-Text
•
2B
•
Updated
Feb 12
•
8
•
4
prithivMLmods/Open-R1-Mini-Experimental-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 12
•
356
•
5
X-iZhang/libra-llava-med-v1.5-mistral-7b
Image-Text-to-Text
•
8B
•
Updated
Aug 15
•
5.61k
•
1
AIDC-AI/Ovis2-4B
Image-Text-to-Text
•
5B
•
Updated
Aug 15
•
172k
•
61
ordis-co-ltd/Qwen2.5-VL-72B-Instruct_exl2_6.0bpw
Image-Text-to-Text
•
Updated
Feb 10
•
4
•
2
drmcbride/UI-TARS-2B-SFT-Q8_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Apr 14
•
7
HuggingFaceTB/SmolVLM2-256M-Video-Instruct
Image-Text-to-Text
•
0.3B
•
Updated
Apr 8
•
653k
•
78
Fancy-MLLM/R1-Onevision-7B
Image-Text-to-Text
•
8B
•
Updated
Feb 25
•
1.58k
•
44
SVECTOR-CORPORATION/Spec-Vision-V1
Image-Text-to-Text
•
4B
•
Updated
Feb 11
•
6
•
4
zyoNoob/Qwen2.5-VL-7B-Instruct-AWQ
Image-Text-to-Text
•
3B
•
Updated
Feb 11
•
6
lxasqjc/lavender-llama-3.2-11b-lora
Image-Text-to-Text
•
Updated
Feb 17
•
2
zyoNoob/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
Feb 11
•
4
matrixportalx/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 11
•
13
mehmetkeremturkcan/FemtoVLM-Femto
Image-Text-to-Text
•
Updated
Feb 18
darthhexx/Qwen2.5-VL-3B-Instruct-FP8-Dynamic
Image-Text-to-Text
•
4B
•
Updated
Feb 11
•
3
mlx-community/ui-tars-7b-dpo
Image-Text-to-Text
•
1B
•
Updated
Feb 11
•
12
•
1
AsmaaMahmoudSaeddd/Florence-2-Arabic-OCR
Image-Text-to-Text
•
0.3B
•
Updated
Feb 12
•
6
isaiahbjork/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 12
•
23
StevenHH2000/Finedefics
Image-Text-to-Text
•
8B
•
Updated
Feb 12
•
6
•
6
prithivMLmods/Hoags-2B-Exp
Image-Text-to-Text
•
2B
•
Updated
Feb 15
•
3
•
3
mlx-community/SmolVLM2-2.2B-Instruct-mlx
Image-Text-to-Text
•
2B
•
Updated
Feb 20
•
125
•
7
lkg67/minicpm4
Image-Text-to-Text
•
5B
•
Updated
Feb 13
•
4
•
1
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
Image-Text-to-Text
•
1B
•
Updated
Apr 6
•
334k
•
57
Qwen/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
Mar 7
•
69.4k
•
64
OpenGVLab/Mono-InternVL-2B-S1-1
Image-Text-to-Text
•
3B
•
Updated
Jul 22
•
16
Benasd/Qwen2.5-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
Feb 20
•
23.8k
•
6
OpenGVLab/Mono-InternVL-2B-S1-3
Image-Text-to-Text
•
3B
•
Updated
Jul 22
•
14
•
1
kanashi6/UFO
Image-Text-to-Text
•
Updated
Jun 27
•
2
Nikhil-aka-Nick/FlorenceDropout2
Image-Text-to-Text
•
0.8B
•
Updated
Feb 14
•
3
Previous
1
...
74
75
76
77
78
...
100
Next