Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
deepakkumar07 's Collections
vision-llm
tamil-dataset
document-parser
text-to-speech
voice-to-text
Transformers model
csv-dataset

vision-llm

updated Jul 12
Upvote
-

  • Running
    104
    104

    Vision Papers

    💻

    All paper summaries read by Merve


  • Running on Zero
    20
    20

    Ovis2 1B

    🦫

    Small model can do big things.


  • AIDC-AI/Ovis2-8B-GPTQ-Int4

    Image-Text-to-Text • 3B • Updated Mar 25 • 713 • 3

  • AIDC-AI/Ovis2-1B

    Image-Text-to-Text • 1B • Updated 6 days ago • 485k • 92

  • Running on Zero
    13
    13

    Ovis2 8B

    🦫

    Ovis2-8B


  • lambdalabs/Llama-3.3-70B-Instruct-AWQ-4bit

    11B • Updated Dec 10, 2024 • 1.22k • 4

  • microsoft/GUI-Actor-7B-Qwen2-VL

    Image-Text-to-Text • 8B • Updated 12 days ago • 1.07k • 37

  • lambdalabs/sd-image-variations-diffusers

    Image-to-Image • Updated Feb 8, 2023 • 4.55k • 447

  • vikhyatk/moondream2

    Image-Text-to-Text • 2B • Updated Jul 7 • 142k • 1.25k
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略