Edit Models filters

Tasks

Text Generation

Image-Text-to-Text

Parameters

Libraries

Transformers.js

Apps

Inference Providers

Models

6,587

Full-text search

Active filters: image-text-to-text

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated 4 days ago • 17.5k • 505

moondream/moondream3-preview

Image-Text-to-Text • 9B • Updated 3 days ago • 2.56k • 207

Hcompany/Holo1.5-7B

Image-Text-to-Text • 8B • Updated 7 days ago • 853 • 77

ibm-granite/granite-docling-258M-mlx

Image-Text-to-Text • 0.3B • Updated 5 days ago • 2.21k • 33

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated about 7 hours ago • 301k • 975

tencent/POINTS-Reader

Image-Text-to-Text • 4B • Updated 11 days ago • 4.05k • 91

openbmb/MiniCPM-V-4_5

Image-Text-to-Text • 9B • Updated 7 days ago • 72.4k • 965

baidu/Qianfan-VL-8B

Image-Text-to-Text • 9B • Updated 3 days ago • 13 • 24

baidu/Qianfan-VL-70B

Image-Text-to-Text • 72B • Updated 3 days ago • 1 • 24

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated 5 days ago • 1.84k • 22

baidu/Qianfan-VL-3B

Image-Text-to-Text • 4B • Updated 3 days ago • 10 • 20

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 4.35M • • 1.25k

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Aug 18 • 34.8k • • 652

OpenGVLab/ScaleCUA-32B

Image-Text-to-Text • 33B • Updated 5 days ago • 151 • 13

ds4sd/SmolDocling-256M-preview

Image-Text-to-Text • 0.3B • Updated 5 days ago • 160k • 1.58k

Hcompany/Holo1.5-3B

Image-Text-to-Text • 4B • Updated 7 days ago • 307 • 29

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 1.8M • 853

google/medgemma-4b-it

Image-Text-to-Text • 5B • Updated Jul 9 • 102k • 669

fancyfeast/llama-joycaption-beta-one-hf-llava

Image-Text-to-Text • 8B • Updated May 16 • 76.7k • 214

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated Jul 14 • 211k • 772

OpenGVLab/ScaleCUA-3B

Image-Text-to-Text • 4B • Updated 5 days ago • 1.12k • 9

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 943k • • 1.62k

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • 109B • Updated May 22 • 554k • • 1.09k

vikhyatk/moondream2

Image-Text-to-Text • 2B • Updated 4 days ago • 186k • 1.3k

microsoft/Florence-2-large

Image-Text-to-Text • 0.8B • Updated Aug 4 • 708k • 1.67k

google/gemma-3-12b-it

Image-Text-to-Text • 12B • Updated Mar 21 • 515k • • 527

ByteDance-Seed/UI-TARS-1.5-7B

Image-Text-to-Text • 8B • Updated Apr 18 • 73.8k • 398

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20 • 254k • 1.51k

Hcompany/Holo1.5-72B

Image-Text-to-Text • 73B • Updated 7 days ago • 76 • 21

OpenGVLab/ScaleCUA-7B

Image-Text-to-Text • 8B • Updated 5 days ago • 738 • 6