Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,617
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
zw123/delta_clip_h14_336
Image-Text-to-Text
•
Updated
Apr 9
•
59
zw123/delta_clip_l14_224
Image-Text-to-Text
•
Updated
Apr 9
•
63
zw123/delta_clip_l14_336
Image-Text-to-Text
•
Updated
Apr 9
•
31
egemengulpinar/gemma-3-4b-it-Q2_K-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
19
kabachuha/gemma3-4b-it-abliterated
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
4
•
1
hustvl/MaTVLM_0_25_Mamba2
Image-Text-to-Text
•
3B
•
Updated
Mar 18
•
6
•
1
burtenshaw/gemma-3-4b-it-capybara-test
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
7
frontrx/SmolVLM2_ECG
Image-Text-to-Text
•
0.5B
•
Updated
Mar 16
•
4
Elcaida/gemma3_4b_1st
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
3
IPEC-COMMUNITY/spatialvla-4b-224-sft-fractal
Image-Text-to-Text
•
4B
•
Updated
Mar 24
•
2.4k
odomcl22/gemma-3-4b-it-Q8_0-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
17
Skywork/SkyworkVL-2B
Image-Text-to-Text
•
2B
•
Updated
Jun 13
•
15
•
8
mlabonne/gemma-3-4b-it-abliterated
Image-Text-to-Text
•
4B
•
Updated
Mar 21
•
1.98k
•
24
mlabonne/gemma-3-12b-it-abliterated
Image-Text-to-Text
•
12B
•
Updated
Mar 21
•
1.2k
•
22
mlabonne/gemma-3-27b-it-abliterated
Image-Text-to-Text
•
27B
•
Updated
Mar 21
•
11.4k
•
•
201
Ba2han/Gemini-3-27B-Think-0.2
Image-Text-to-Text
•
27B
•
Updated
Mar 16
•
2
austinfujimori/molmo-endpoint
Image-Text-to-Text
•
Updated
Mar 17
nmcco/gemma-3-4b-pt-with-speaker-tokens
Image-Text-to-Text
•
4B
•
Updated
Mar 16
•
2
zw123/delta2_llava_4_v1_5_7b
Image-Text-to-Text
•
Updated
Apr 9
•
3
zw123/delta2_llava_8_v1_5_7b
Image-Text-to-Text
•
Updated
Apr 9
•
3
msyukorai/gemma-3-4b-it-Q4_0-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 17
•
18
FriendliAI/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Mar 17
•
4
•
1
FriendliAI/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Mar 17
•
118
FriendliAI/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Mar 17
•
2
•
1
tonybucket/gemma-3-4b-it-Q5_K_M-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 17
•
7
Skywork/Skywork-R1V-38B
Image-Text-to-Text
•
38B
•
Updated
Aug 12
•
69.8k
•
127
lhquangminh/gemma-3-4b-it-Q4_0-GGUF
Image-Text-to-Text
•
4B
•
Updated
Mar 17
•
2
aimagelab/HySAC
Image-Text-to-Text
•
Updated
Mar 21
•
1
abhishekchohan/gemma-3-12b-it-quantized-W4A16
Image-Text-to-Text
•
3B
•
Updated
Mar 17
•
4.13k
•
5
abhishekchohan/gemma-3-27b-it-quantized-W4A16
Image-Text-to-Text
•
5B
•
Updated
Mar 17
•
1.06k
•
4
Previous
1
...
89
90
91
92
93
...
100
Next