Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Nebius AI
Fireworks
Novita
Together AI
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,723
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
ChengyouJia/ChatGen-Base-4B
Image-Text-to-Text
•
4B
•
Updated
Nov 29, 2024
•
2
•
2
ChengyouJia/ChatGen-Base-8B
Image-Text-to-Text
•
8B
•
Updated
Nov 29, 2024
•
8
•
2
YipengZhang/LLaVA-UHD-v2-Vicuna-7B
Image-Text-to-Text
•
8B
•
Updated
Mar 31
•
9
•
6
HuggingFaceTB/SmolVLM-Instruct-DPO
Image-Text-to-Text
•
Updated
Nov 26, 2024
•
16
•
22
OpenGVLab/InternVL2-Pretrain-Models
Image-Text-to-Text
•
Updated
Mar 25
•
11
mlx-community/SmolVLM-Instruct-4bit
Image-Text-to-Text
•
0.5B
•
Updated
Nov 29, 2024
•
1.07k
•
5
mlx-community/SmolVLM-Instruct-6bit
Image-Text-to-Text
•
0.6B
•
Updated
Nov 29, 2024
•
7
mlx-community/SmolVLM-Instruct-8bit
Image-Text-to-Text
•
0.7B
•
Updated
Nov 29, 2024
•
34
•
9
mlx-community/SmolVLM-Instruct-bf16
Image-Text-to-Text
•
2B
•
Updated
Nov 29, 2024
•
18
•
5
shi-labs/OLA-VLM-CLIP-ViT-Llama3-8b
Image-Text-to-Text
•
8B
•
Updated
Dec 10, 2024
•
5
shi-labs/OLA-VLM-CLIP-ViT-Phi3-4k-mini
Image-Text-to-Text
•
4B
•
Updated
Dec 10, 2024
•
4
•
1
shi-labs/OLA-VLM-CLIP-ConvNeXT-Llama3-8b
Image-Text-to-Text
•
9B
•
Updated
Dec 10, 2024
•
5
•
1
shi-labs/OLA-VLM-CLIP-ConvNeXT-Phi3-4k-mini
Image-Text-to-Text
•
5B
•
Updated
Dec 10, 2024
•
7
•
1
mlx-community/Idefics3-8B-Llama3-bf16
Image-Text-to-Text
•
8B
•
Updated
Jan 30
•
14
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
15B
•
Updated
17 days ago
•
1.12k
•
29
scwoods/Florence-2-FT-DocVQA
Image-Text-to-Text
•
0.3B
•
Updated
Nov 27, 2024
•
3
zdq/GotOcr2-hf-8bit
Image-Text-to-Text
•
0.6B
•
Updated
Nov 27, 2024
•
5
zai-org/glm-edge-v-5b-gguf
Image-Text-to-Text
•
4B
•
Updated
Nov 28, 2024
•
911
•
8
zai-org/glm-edge-v-2b-gguf
Image-Text-to-Text
•
2B
•
Updated
Nov 28, 2024
•
443
•
9
kp-forks/Florence-2-large
Image-Text-to-Text
•
Updated
Dec 8, 2024
•
4
morthens/qwen2-vl-infer
Image-Text-to-Text
•
3B
•
Updated
Nov 29, 2024
•
5
IntJudge/IntJudge
Image-Text-to-Text
•
8B
•
Updated
Nov 29, 2024
•
5
•
2
unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
2B
•
Updated
Mar 9
•
5.5k
•
7
unsloth/Qwen2-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
5B
•
Updated
Mar 9
•
10k
•
12
morthens/qwen2-vl-7b-infer
Image-Text-to-Text
•
8B
•
Updated
Nov 29, 2024
•
3
morthens/qwen2-vl-2b-infer
Image-Text-to-Text
•
2B
•
Updated
Nov 29, 2024
•
2
gautamgc17/llama3.2-vlm-torchtune
Image-Text-to-Text
•
11B
•
Updated
Nov 29, 2024
•
3
teamcraft/TeamCraft-VLA-7B-Dec
Image-Text-to-Text
•
7B
•
Updated
Dec 2, 2024
•
27
teamcraft/TeamCraft-VLA-7B-Cen
Image-Text-to-Text
•
7B
•
Updated
Dec 2, 2024
•
136
rhymes-ai/Aria-Base-64K
Image-Text-to-Text
•
25B
•
Updated
Dec 1, 2024
•
8
•
14
Previous
1
...
51
52
53
54
55
...
100
Next