Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Nebius AI
Cerebras
Novita
Together AI
Fireworks
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,690
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
prithivMLmods/Omni-Reasoner-2B
Image-Text-to-Text
•
2B
•
Updated
May 3
•
4
•
4
jp1924/KoLLaVa9b-patch14-384-llava-inst
Image-Text-to-Text
•
10B
•
Updated
Jan 17
URSA-MATH/URSA-8B
Image-Text-to-Text
•
8B
•
Updated
Mar 10
•
48
Isotr0py/deepseek-vl2-tiny
Image-Text-to-Text
•
3B
•
Updated
Jan 17
•
26
URSA-MATH/URSA-RM-8B
Image-Text-to-Text
•
8B
•
Updated
Feb 18
•
11
gongting/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 18
•
3
ewre324/moondream2
Image-Text-to-Text
•
2B
•
Updated
Jan 18
•
90
HuanjinYao/Mulberry_llama_11b
Image-Text-to-Text
•
11B
•
Updated
Jan 19
•
4
prithivMLmods/Radiology-Infer-Mini
Image-Text-to-Text
•
2B
•
Updated
Jul 9
•
3.53k
•
13
paramedik/Qwen2-VL-7B-Instruct-Q8_0-GGUF
Image-Text-to-Text
•
8B
•
Updated
Jan 18
•
1
Minthy/ToriiGate-v0.4-2B
Image-Text-to-Text
•
2B
•
Updated
Jan 19
•
1.05k
•
11
0xchuks/Moxxie
Image-Text-to-Text
•
Updated
Jan 19
•
1
ByteDance-Seed/UI-TARS-2B-SFT
Image-Text-to-Text
•
2B
•
Updated
Jan 25
•
11.2k
•
26
ByteDance-Seed/UI-TARS-7B-SFT
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
9.48k
•
178
ByteDance-Seed/UI-TARS-72B-SFT
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
47
•
23
roleplaiapp/Omni-Reasoner-2B-Q2_K-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
37
roleplaiapp/Omni-Reasoner-2B-Q8_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
11
roleplaiapp/Omni-Reasoner-2B-Q6_K-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
1
roleplaiapp/Omni-Reasoner-2B-Q4_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
23
roleplaiapp/Omni-Reasoner-2B-Q3_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
4
roleplaiapp/Omni-Reasoner-2B-Q3_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
9
roleplaiapp/Omni-Reasoner-2B-Q3_K_L-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
13
roleplaiapp/Omni-Reasoner-2B-Q4_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
3
roleplaiapp/Omni-Reasoner-2B-Q5_0-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
5
roleplaiapp/Omni-Reasoner-2B-Q5_K_M-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
15
roleplaiapp/Omni-Reasoner-2B-Q5_K_S-GGUF
Image-Text-to-Text
•
2B
•
Updated
Jan 20
•
3
OpenVINO/Phi-3.5-vision-instruct-fp16-ov
Image-Text-to-Text
•
Updated
Aug 21
•
116
OpenVINO/Phi-3.5-vision-instruct-int8-ov
Image-Text-to-Text
•
Updated
Mar 18
•
1.25k
•
2
OpenVINO/Phi-3.5-vision-instruct-int4-ov
Image-Text-to-Text
•
Updated
Jul 22
•
1.47k
DetionDX/llava-v1.5-13b
Image-Text-to-Text
•
13B
•
Updated
Jan 21
•
3
Previous
1
...
67
68
69
70
71
...
100
Next