Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
Featherless AI
fal
Groq
+ 8
Apply filters
Models
6,596
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
numind/NuExtract-2.0-8B
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
3.87k
•
35
unsloth/Qwen2.5-VL-3B-Instruct-GGUF
Image-Text-to-Text
•
3B
•
Updated
May 12
•
7.86k
•
12
unsloth/Qwen2.5-VL-32B-Instruct-GGUF
Image-Text-to-Text
•
33B
•
Updated
May 12
•
12.7k
•
5
ByteDance/Dolphin
Image-Text-to-Text
•
0.4B
•
Updated
Jul 16
•
52.5k
•
478
Hcompany/Holo1-7B
Image-Text-to-Text
•
8B
•
Updated
Jun 10
•
2k
•
222
unsloth/medgemma-27b-text-it-GGUF
Image-Text-to-Text
•
27B
•
Updated
May 20
•
6.18k
•
51
mlabonne/gemma-3-12b-it-abliterated-v2-GGUF
Image-Text-to-Text
•
12B
•
Updated
May 29
•
6.22k
•
30
lmstudio-community/medgemma-4b-it-MLX-4bit
Image-Text-to-Text
•
0.9B
•
Updated
May 29
•
1.47k
•
2
XiaomiMiMo/MiMo-VL-7B-RL
Image-Text-to-Text
•
8B
•
Updated
Jun 7
•
19.8k
•
160
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
1.33k
•
23
lingshu-medical-mllm/Lingshu-32B
Image-Text-to-Text
•
33B
•
Updated
6 days ago
•
6.35k
•
59
echo840/MonkeyOCR
Image-Text-to-Text
•
Updated
26 days ago
•
606
•
510
lingshu-medical-mllm/Lingshu-7B
Image-Text-to-Text
•
8B
•
Updated
6 days ago
•
8.32k
•
58
mlx-community/Nanonets-OCR-s-bf16
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
106
•
2
moonshotai/Kimi-VL-A3B-Thinking-2506
Image-Text-to-Text
•
16B
•
Updated
Aug 18
•
16.5k
•
307
unsloth/Mistral-Small-3.2-24B-Instruct-2506-FP8
Image-Text-to-Text
•
Updated
Jun 21
•
74
•
6
Vchitect/ShotVL-7B
Image-Text-to-Text
•
8B
•
Updated
4 days ago
•
1.31k
•
14
unsloth/gemma-3n-E4B-it-GGUF
Image-Text-to-Text
•
7B
•
Updated
Jun 30
•
46.8k
•
162
unsloth/gemma-3n-E4B-it-unsloth-bnb-4bit
Image-Text-to-Text
•
8B
•
Updated
Jul 11
•
32.5k
•
9
amine-khelif/MaVistral-GGUF
Image-Text-to-Text
•
24B
•
Updated
Jul 7
•
87
•
5
zai-org/GLM-4.1V-9B-Base
Image-Text-to-Text
•
10B
•
Updated
26 days ago
•
5.18k
•
54
baidu/ERNIE-4.5-VL-424B-A47B-Base-Paddle
Image-Text-to-Text
•
424B
•
Updated
Aug 19
•
1.07k
•
60
echo840/MonkeyOCR-pro-3B
Image-Text-to-Text
•
Updated
26 days ago
•
558
•
3
echo840/MonkeyOCR-pro-1.2B
Image-Text-to-Text
•
Updated
26 days ago
•
534
•
15
openbmb/MiniCPM-V-4
Image-Text-to-Text
•
4B
•
Updated
8 days ago
•
37.6k
•
461
openbmb/MiniCPM-V-4-int4
Image-Text-to-Text
•
2B
•
Updated
8 days ago
•
718
•
6
nvidia/VideoITG-8B
Image-Text-to-Text
•
8B
•
Updated
Aug 13
•
166
•
7
allenai/olmOCR-7B-0725
Image-Text-to-Text
•
8B
•
Updated
28 days ago
•
8.91k
•
58
CohereLabs/command-a-vision-07-2025
Image-Text-to-Text
•
112B
•
Updated
Aug 2
•
53.2k
•
•
83
drwlf/MedraN-E4B
Image-Text-to-Text
•
8B
•
Updated
Aug 13
•
5
•
1
Previous
1
...
5
6
7
8
9
...
100
Next