Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,683
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
Aitrepreneur/Florence-2-base
Image-Text-to-Text
•
Updated
Feb 1
•
4
Aitrepreneur/Florence-2-large
Image-Text-to-Text
•
Updated
Feb 1
•
3
Triangle104/LatexMind-2B-Codec-Q4_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
4
Triangle104/LatexMind-2B-Codec-Q4_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
12
Triangle104/LatexMind-2B-Codec-Q5_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
7
Triangle104/LatexMind-2B-Codec-Q5_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
4
Triangle104/LatexMind-2B-Codec-Q6_K-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
5
Triangle104/LatexMind-2B-Codec-Q8_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Feb 2
•
7
pauljmorris/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
Feb 2
krutrim-ai-labs/Chitrarth
Image-Text-to-Text
•
8B
•
Updated
Mar 26
•
181
•
15
mukulp/Qwen2.5-VL-72B-Instruct-bf16
Image-Text-to-Text
•
73B
•
Updated
Feb 2
•
21
mehmetkeremturkcan/DeepSeek-LLaVA-Instruct
Image-Text-to-Text
•
Updated
Feb 2
•
1
pedalnomica/InternVL2_5-78B-MPO-AWQ
Image-Text-to-Text
•
18B
•
Updated
Feb 2
•
6
ilpa-user/mp3_nc_vision
Image-Text-to-Text
•
Updated
Feb 3
•
3
billatsectorflow/Qwen2-VL-7B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
3B
•
Updated
Feb 3
•
3
moot20/SmolVLM-500M-Instruct-MLX-4bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
3
moot20/SmolVLM-500M-Instruct-MLX-6bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
8
moot20/SmolVLM-500M-Instruct-MLX-8bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
8
moot20/SmolVLM-256M-Instruct-MLX-4bits
Image-Text-to-Text
•
0.0B
•
Updated
Feb 19
•
10
moot20/SmolVLM-256M-Instruct-MLX-6bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
5
moot20/SmolVLM-256M-Instruct-MLX-8bits
Image-Text-to-Text
•
0.1B
•
Updated
Feb 19
•
7
moot20/SmolVLM-256M-Instruct-MLX
Image-Text-to-Text
•
0.3B
•
Updated
Feb 19
•
17
moot20/SmolVLM-500M-Instruct-MLX
Image-Text-to-Text
•
0.5B
•
Updated
Feb 19
•
6
google/paligemma2-10b-mix-448-jax
Image-Text-to-Text
•
Updated
Feb 7
•
3
InfiX-ai/InfiGUIAgent-2B-Stage1
Image-Text-to-Text
•
2B
•
Updated
Feb 6
•
10
•
3
google/paligemma2-3b-mix-224-jax
Image-Text-to-Text
•
Updated
Feb 7
•
2
•
1
nm-testing/Pixtral-Large-Instruct-2411-hf
Image-Text-to-Text
•
124B
•
Updated
Feb 6
•
130
iamraafay/deepseek-vl-1.3b-4bitill-qwen-1.5b
Image-Text-to-Text
•
0.3B
•
Updated
Feb 4
•
21
ThatEvan/Qwen2-VL-7B-Instruct-Q8_0-GGUF
Image-Text-to-Text
•
8B
•
Updated
Feb 4
HuanjinYao/Mulberry_qwen2vl_7b
Image-Text-to-Text
•
8B
•
Updated
Feb 4
•
79
•
2
Previous
1
...
72
73
74
75
76
...
100
Next