Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
AXERA-TECH 's Collections
Multimodal Models
Qwen2.5
MiniCPM4
Qwen3
DeepSeek-R1-Distill
HuggingFaceTB
Vision Models
Audio Models
Tools
TestData

Multimodal Models

updated about 22 hours ago
Upvote
-

  • AXERA-TECH/lcm-lora-sdv1-5

    Updated Jun 23 • 5 • 1

  • AXERA-TECH/InternVL3-2B

    Visual Question Answering • Updated 17 days ago • 14 • 2

  • AXERA-TECH/Qwen2.5-VL-3B-Instruct

    Image-Text-to-Text • Updated 15 days ago • 19

  • AXERA-TECH/InternVL3-1B

    Image-Text-to-Text • Updated Jun 28 • 13

  • AXERA-TECH/SmolVLM2-500M-Video-Instruct

    Visual Question Answering • Updated Jul 14 • 7 • 2

  • AXERA-TECH/InternVL2_5-1B-MPO

    Image-Text-to-Text • Updated 13 days ago • 13

  • AXERA-TECH/InternVL2_5-1B

    Image-Text-to-Text • Updated Apr 4 • 4 • 1

  • AXERA-TECH/Janus-Pro-1B

    Visual Question Answering • Updated Apr 14 • 4 • 2

  • AXERA-TECH/SmolVLM-256M-Instruct

    Updated Apr 4 • 12 • 2

  • AXERA-TECH/LivePortrait

    Image-to-Video • Updated Jun 21 • 2 • 4

  • AXERA-TECH/cnclip

    Updated 17 days ago • 10 • 1

  • AXERA-TECH/clip

    Updated 17 days ago • 9

  • AXERA-TECH/Qwen2.5-VL-7B-Instruct

    Image-Text-to-Text • Updated 15 days ago • 10

  • AXERA-TECH/YOLO-World-V2

    Zero-Shot Object Detection • Updated about 23 hours ago • 6
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略