Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
VoxCPM
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 42
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
Safetensors
ONNX
GGUF
Transformers.js
MLX
Keras
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 13
Inference Providers
Fireworks
Cerebras
Novita
Nebius AI
Together AI
fal
Nscale
Groq
+ 9
Apply filters
Models
8,838
Full-text search
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
sbintuitions/sarashina2-vision-8b
Image-to-Text
•
8B
•
Updated
Mar 27
•
559
•
8
Andres77872/SmolVLM-500M-anime-caption-v0.2
Image-to-Text
•
0.5B
•
Updated
May 12
•
399
•
7
unsloth/Cosmos-Reason1-7B
Image-to-Text
•
8B
•
Updated
May 24
•
12
•
2
PaddlePaddle/PP-OCRv5_server_rec
Image-to-Text
•
Updated
Jul 22
•
132k
•
14
PaddlePaddle/PP-Chart2Table
Image-to-Text
•
Updated
Jul 22
•
4.98k
•
2
alpharomercoma/qwen2.5-vl-7b-ft-latex-f16
Image-to-Text
•
Updated
Jun 15
•
6
•
1
l0wgear/manga-ocr-2025-onnx
Image-to-Text
•
Updated
Jun 30
•
83
•
1
allura-org/MS3.2-24b-Angel
Image-to-Text
•
24B
•
Updated
Sep 3
•
2.57k
•
12
loay/ArabicOCR-Qwen2.5-VL-7B-Vision
Image-to-Text
•
8B
•
Updated
Jul 18
•
451
•
3
allenai/olmOCR-7B-0725-FP8
Image-to-Text
•
8B
•
Updated
Aug 19
•
19.3k
•
18
allenai/olmOCR-7B-0825-FP8
Image-to-Text
•
8B
•
Updated
Aug 13
•
150k
•
6
ob11/Qwen-VL-PRM-3B
Image-to-Text
•
4B
•
Updated
6 days ago
•
31
•
1
farbodpya/Persian-OCR
Image-to-Text
•
Updated
7 days ago
•
140
•
2
Zhare-AI/janus-pro-7b-webgpu
Image-to-Text
•
Updated
9 days ago
•
32
•
1
OfficerChul/InfiGUI-G1-3B-Android-Control
Image-to-Text
•
4B
•
Updated
7 days ago
•
17
•
1
OnesimeMoffo/llama3.2vision-finetuned
Image-to-Text
•
11B
•
Updated
7 days ago
•
28
•
1
Muhammadidrees/RaiyaChatDoc
Image-to-Text
•
4B
•
Updated
7 days ago
•
30
•
1
OfficerChul/Qwen2.5-VL-7B-Instruct-Android-Control
Image-to-Text
•
8B
•
Updated
3 days ago
•
16
•
1
thesby/Qwen2.5-VL-7B-NSFW-Caption-V3
Image-to-Text
•
8B
•
Updated
Jun 17
•
3.47k
•
78
MahsaShahidi/Persian-Image-Captioning
Image-to-Text
•
Updated
Feb 22, 2022
•
25
•
2
adalbertojunior/image_captioning_portuguese
Image-to-Text
•
Updated
Jul 17, 2024
•
18
•
1
danasone/testpush
Image-to-Text
•
Updated
Jan 1, 2022
•
5
g8a9/vit-geppetto-captioning
Image-to-Text
•
Updated
Nov 29, 2021
•
6
gagan3012/ViTGPT2I2A
Image-to-Text
•
Updated
Feb 8, 2022
•
5
gagan3012/ViTGPT2_VW
Image-to-Text
•
Updated
Feb 7, 2022
•
15
gagan3012/ViTGPT2_vizwiz
Image-to-Text
•
Updated
Feb 7, 2022
•
24
•
1
keras-io/image-captioning
Image-to-Text
•
Updated
Jan 13, 2022
•
8
keras-io/ocr-for-captcha
Image-to-Text
•
Updated
May 29, 2022
•
89
•
80
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
349k
•
190
microsoft/trocr-base-stage1
Image-to-Text
•
0.4B
•
Updated
May 27, 2024
•
35.3k
•
14
Previous
1
2
3
4
...
100
Next