Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Cerebras
Nebius AI
Novita
Fireworks
Together AI
fal
Groq
Featherless AI
+ 8
Apply filters
Models
6,698
Full-text search
Edit filters
Sort: Trending
Active filters:
image-text-to-text
Clear all
kpkom/Florence-2-FT-DocVQA
Image-Text-to-Text
•
0.3B
•
Updated
Oct 9, 2024
•
3
smishah774/Florence-2-FT-DocVQA-smits
Image-Text-to-Text
•
0.3B
•
Updated
Sep 15, 2024
•
3
mallapraveen/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
Sep 15, 2024
•
10
smishah774/Florence-2-FT-amazone
Image-Text-to-Text
•
0.3B
•
Updated
Sep 15, 2024
•
3
royleibov/Qwen2-VL-7B-Instruct-ZipNN-Compressed
Image-Text-to-Text
•
Updated
Sep 15, 2024
•
5
•
1
Sarvesh2003/florence_ft_base_ankush1
Image-Text-to-Text
•
0.3B
•
Updated
Sep 15, 2024
•
3
Sarvesh2003/florence_ft_base_ankush2
Image-Text-to-Text
•
0.3B
•
Updated
Sep 15, 2024
•
2
Sarvesh2003/florence_ft_base_ankush3
Image-Text-to-Text
•
0.3B
•
Updated
Sep 16, 2024
•
3
Sarvesh2003/florence_ft_base_ankush4
Image-Text-to-Text
•
0.3B
•
Updated
Sep 16, 2024
•
4
Sarvesh2003/florence_ft_base_ankush5
Image-Text-to-Text
•
0.3B
•
Updated
Sep 16, 2024
•
2
Ansh007/DocVQA
Image-Text-to-Text
•
0.8B
•
Updated
Sep 16, 2024
•
2
Akshath123/DamageCarModel
Image-Text-to-Text
•
0.3B
•
Updated
Sep 16, 2024
•
2
llava-hf/llava-onevision-qwen2-7b-ov-chat-hf
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
1.81k
•
5
llava-hf/llava-onevision-qwen2-72b-ov-chat-hf
Image-Text-to-Text
•
73B
•
Updated
Jun 18
•
1.3k
•
3
fredaddy/MiniCPM-v-2_6
Image-Text-to-Text
•
Updated
Sep 19, 2024
•
4
benchang1110/TaiVisionLM-base-v2
Image-Text-to-Text
•
1B
•
Updated
Sep 25, 2024
•
44
•
4
Qwen/Qwen2-VL-72B-Instruct
Image-Text-to-Text
•
73B
•
Updated
Feb 6
•
9.91k
•
•
307
Qwen/Qwen2-VL-72B-Instruct-AWQ
Image-Text-to-Text
•
13B
•
Updated
Sep 25, 2024
•
1.36k
•
50
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
13B
•
Updated
Sep 24, 2024
•
1.52k
•
28
Qwen/Qwen2-VL-72B-Instruct-GPTQ-Int8
Image-Text-to-Text
•
22B
•
Updated
Sep 24, 2024
•
1.06k
•
11
erax-ai/EraX-VL-7B-V1.0
Image-Text-to-Text
•
8B
•
Updated
Jan 15
•
1.14k
•
41
Mujtaba007/Florence-2-FT-DocVQA-MUJTABA-DRONE
Image-Text-to-Text
•
0.3B
•
Updated
Sep 17, 2024
•
4
natong19/Qwen2-VL-7B-Instruct-abliterated
Image-Text-to-Text
•
8B
•
Updated
Sep 18, 2024
•
5
arvisioncode/florence_custom_uom1
Image-Text-to-Text
•
0.8B
•
Updated
Sep 18, 2024
•
2
AIDC-AI/Ovis1.6-Gemma2-9B
Image-Text-to-Text
•
10B
•
Updated
Aug 15
•
856
•
276
fredaddy/Qwen-VL-7B-2
Image-Text-to-Text
•
8B
•
Updated
Sep 19, 2024
•
4
meta-llama/Llama-3.2-90B-Vision
Image-Text-to-Text
•
89B
•
Updated
Sep 27, 2024
•
5.06k
•
131
SiyuanH/UniAff-13B
Image-Text-to-Text
•
Updated
Oct 7, 2024
•
3
AIML-TUDA/LlavaGuard-v1.1-7B-hf
Image-Text-to-Text
•
7B
•
Updated
Apr 22
•
71
•
3
AIML-TUDA/LlavaGuard-v1.0-13B-hf
Image-Text-to-Text
•
13B
•
Updated
Apr 22
•
4
•
2
Previous
1
...
37
38
39
40
41
...
100
Next