Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Nscale
+ 8
Apply filters
Models
6,653
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
ctranslate2-4you/InternVL2_5-1B
Image-Text-to-Text
•
0.9B
•
Updated
Feb 28
•
3
ctranslate2-4you/InternVL2_5-4B
Image-Text-to-Text
•
4B
•
Updated
Feb 28
•
3
turningpoint-ai/VisualThinker-R1-Zero
Image-Text-to-Text
•
2B
•
Updated
Apr 15
•
529
•
6
ctranslate2-4you/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
Mar 2
•
3
ljnlonoljpiljm/florence-2-base-ft-region-proposal
Image-Text-to-Text
•
0.3B
•
Updated
Mar 11
•
10
saim1212/qwen2_2b_git2
Image-Text-to-Text
•
2B
•
Updated
Mar 1
•
4
adityaghai07/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Mar 1
•
1
Captaint2004/Qwen2-VL-2B-Instruct-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Mar 1
•
11
google/gemma-3-27b-pt
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
27.8k
•
107
google/gemma-3-12b-pt
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
266k
•
71
CohereLabs/aya-vision-8b
Image-Text-to-Text
•
9B
•
Updated
14 days ago
•
66.5k
•
•
311
assentian1970/mplug3_dsd
Image-Text-to-Text
•
8B
•
Updated
Mar 2
•
2
CohereLabs/aya-vision-32b
Image-Text-to-Text
•
33B
•
Updated
14 days ago
•
103
•
•
216
mlx-community/UI-TARS-7B-SFT-4bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
7
mlx-community/UI-TARS-7B-DPO-4bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
6
mlx-community/UI-TARS-7B-SFT-6bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
5
mlx-community/UI-TARS-7B-DPO-6bit
Image-Text-to-Text
•
2B
•
Updated
Mar 3
•
3
mlx-community/UI-TARS-7B-SFT-8bit
Image-Text-to-Text
•
3B
•
Updated
Mar 3
•
7
mlx-community/UI-TARS-7B-SFT-bf16
Image-Text-to-Text
•
8B
•
Updated
Mar 3
•
6
mlx-community/UI-TARS-7B-DPO-8bit
Image-Text-to-Text
•
3B
•
Updated
Mar 3
•
13
•
1
OpenGVLab/InternVL2_5-Pretrain-Models
Image-Text-to-Text
•
Updated
Mar 25
•
6
mlx-community/UI-TARS-7B-DPO-bf16
Image-Text-to-Text
•
8B
•
Updated
Mar 3
•
7
egeozsoy/MM-OR
Image-Text-to-Text
•
Updated
about 1 month ago
rootonchair/InternVL2_5-4B-AWQ
Image-Text-to-Text
•
1B
•
Updated
Mar 3
•
4
•
2
mlx-community/UI-TARS-72B-SFT-4bit
Image-Text-to-Text
•
12B
•
Updated
Mar 3
•
8
mlx-community/UI-TARS-72B-SFT-6bit
Image-Text-to-Text
•
17B
•
Updated
Mar 3
•
4
mlx-community/UI-TARS-72B-SFT-8bit
Image-Text-to-Text
•
21B
•
Updated
Mar 3
•
6
mlx-community/UI-TARS-72B-SFT-bf16
Image-Text-to-Text
•
73B
•
Updated
Mar 3
•
8
•
1
PommesPeter/prism-qwen25-extra-siglip-224px-0_5b
Image-Text-to-Text
•
Updated
10 days ago
•
29
mradermacher/ToriiGate-v0.4-7B-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jul 31
•
566
•
1
Previous
1
...
81
82
83
84
85
...
100
Next