Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Nebius AI
Fireworks
Novita
Together AI
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,726
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
google/paligemma2-10b-pt-224-jax
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
1
google/paligemma2-10b-pt-448-jax
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
1
google/paligemma2-10b-pt-896-jax
Image-Text-to-Text
•
Updated
Dec 5, 2024
•
2
•
2
google/paligemma2-28b-mix-448
Image-Text-to-Text
•
28B
•
Updated
Feb 7
•
1.87k
•
27
google/paligemma2-28b-pt-224
Image-Text-to-Text
•
28B
•
Updated
Dec 5, 2024
•
38
•
7
google/paligemma2-28b-pt-448
Image-Text-to-Text
•
28B
•
Updated
Dec 5, 2024
•
26
•
10
morthens/qwen2-vl-inference
Image-Text-to-Text
•
3B
•
Updated
Nov 29, 2024
•
6
AIML-TUDA/LlavaGuard-v1.2-0.5B-OV
Image-Text-to-Text
•
0.9B
•
Updated
Jan 17
•
5
•
2
AIML-TUDA/LlavaGuard-v1.2-0.5B-OV-hf
Image-Text-to-Text
•
0.9B
•
Updated
Jan 17
•
7.72k
•
4
google/paligemma2-28b-pt-896
Image-Text-to-Text
•
28B
•
Updated
Dec 5, 2024
•
1.8k
•
49
HuggingFaceTB/SmolVLM-Base
Image-Text-to-Text
•
2B
•
Updated
Nov 28, 2024
•
4.36k
•
81
HuggingFaceTB/SmolVLM-Synthetic
Image-Text-to-Text
•
2B
•
Updated
Nov 26, 2024
•
20
•
12
google/paligemma2-28b-mix-224
Image-Text-to-Text
•
28B
•
Updated
Feb 7
•
2.17k
•
4
andrewqian123/MiniCPM-V-2_6
Image-Text-to-Text
•
8B
•
Updated
Nov 29, 2024
•
4
stepfun-ai/GOT-OCR-2.0-hf
Image-Text-to-Text
•
0.6B
•
Updated
Jan 31
•
56.7k
•
213
gwkrsrch/Elva-Phi3-3.8B
Image-Text-to-Text
•
4B
•
Updated
Nov 23, 2024
•
7
•
1
OmNagvekar/florence-2-fine-tunned-cleanliness
Image-Text-to-Text
•
0.3B
•
Updated
Nov 23, 2024
•
2
mlx-community/Florence-2-large-ft-6bit
Image-Text-to-Text
•
0.2B
•
Updated
Nov 23, 2024
•
13
saujasv/Idefics3-8B-Llama3
Image-Text-to-Text
•
8B
•
Updated
Nov 23, 2024
•
8
mlx-community/Molmo-7B-D-0924-6bit
Image-Text-to-Text
•
2B
•
Updated
Dec 27, 2024
•
10
mlx-community/Florence-2-base-ft-6bit
Image-Text-to-Text
•
0.1B
•
Updated
Nov 24, 2024
•
5
mlx-community/Florence-2-base-ft-3bit
Image-Text-to-Text
•
0.0B
•
Updated
Nov 24, 2024
•
7
mlx-community/Molmo-7B-D-0924-3bit
Image-Text-to-Text
•
1B
•
Updated
Dec 27, 2024
•
13
sagaxlearn/llama3.2-vision
Image-Text-to-Text
•
Updated
Nov 24, 2024
TrgTuan10/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
8B
•
Updated
Nov 25, 2024
•
6
zai-org/glm-edge-v-2b
Image-Text-to-Text
•
2B
•
Updated
Jan 2
•
2.39k
•
12
zai-org/glm-edge-v-5b
Image-Text-to-Text
•
5B
•
Updated
Jan 2
•
139
•
12
tadkt/GOT_Vietnamese
Image-Text-to-Text
•
0.6B
•
Updated
May 26
•
23
nmiraz/Florence-2-image-LLM
Image-Text-to-Text
•
0.3B
•
Updated
Nov 24, 2024
•
2
NCSOFT/VARCO-VISION-14B
Image-Text-to-Text
•
15B
•
Updated
17 days ago
•
3.68k
•
36
Previous
1
...
50
51
52
53
54
...
100
Next