Latest SOTA models supported on Qualcomm NPU.
AI & ML interests
On Device AI Deployment and Research
Recent Activity
Organization Card
Welcome to Nexa AI org on HuggingFace!
Nexa AI is an on device AI deployment and research company. We craft optimized foundation models and on-device inference framework that runs any model on any device, across any backend—within minutes. Our mission is to make on device AI friction‑free and production‑ready.
On this page you’ll find
- Our own trained checkpoints
- Hand‑picked community models in GGUF or MLX formats, ready to run on nexa-sdk
Resources
- ⚙️ Download nexaSDK – get up and run models locally in minutes
- 💬 Discord Community
- 💼 Slack Community
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
-
NexaAI/gemma-3n-E4B-it-4bit-MLX
Image-Text-to-Text • Updated • 122 • 1 -
NexaAI/Qwen2.5-VL-7B-Instruct-4bit-MLX
Image-Text-to-Text • 2B • Updated • 76 -
NexaAI/SmolVLM-500M-Instruct-8bit-MLX
Image-Text-to-Text • 0.7B • Updated • 32 -
NexaAI/SmolVLM-Instruct-8bit-MLX
Image-Text-to-Text • 0.7B • Updated • 43
Latest SOTA models supported on Qualcomm NPU.
Language Models that takes vision input and/or audio input, hand picked by Nexa Team.
-
NexaAI/gemma-3n-E4B-it-4bit-MLX
Image-Text-to-Text • Updated • 122 • 1 -
NexaAI/Qwen2.5-VL-7B-Instruct-4bit-MLX
Image-Text-to-Text • 2B • Updated • 76 -
NexaAI/SmolVLM-500M-Instruct-8bit-MLX
Image-Text-to-Text • 0.7B • Updated • 32 -
NexaAI/SmolVLM-Instruct-8bit-MLX
Image-Text-to-Text • 0.7B • Updated • 43
spaces
5
Running
64
Nexa Omni Demo
🎧
Generate text from audio input
Running
79
Omnivlm Dpo Demo
👁
Ask questions about images and get detailed answers
Running
on
CPU Upgrade
29
Open LLM Leaderboard for domains
📊
Ranking for Open-sourced LLMs in different domains
Running
on
CPU Upgrade
32
Nexa AI GGUF Convertor
⚡
Submit a model for quantization and receive an email notification
models
41

NexaAI/OmniVLM-968M
0.5B
•
Updated
•
1.35k
•
522

NexaAI/paddleocr-npu
Updated
•
15

NexaAI/yolov12-npu
Updated
•
17

NexaAI/qwen3-1.7B-npu
Updated
•
18

NexaAI/qwen3-4B-npu
Updated
•
16

NexaAI/OmniNeural-4B
Updated
•
49

NexaAI/Qwen3-4B-GGUF
Text Generation
•
4B
•
Updated
•
283

NexaAI/Qwen3-0.6B-GGUF
Text Generation
•
0.6B
•
Updated
•
4.39k

NexaAI/whisper-large-v3-turbo-MLX
Automatic Speech Recognition
•
Updated
•
104

NexaAI/parakeet-tdt-0.6b-v2-MLX
Automatic Speech Recognition
•
Updated
•
115
•
1