Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Reset Tasks
Multimodal
Audio-Text-to-Text
Image-Text-to-Text
Visual Question Answering
Document Question Answering
Video-Text-to-Text
Visual Document Retrieval
Any-to-Any
Computer Vision
Depth Estimation
Image Classification
Object Detection
Image Segmentation
Text-to-Image
Image-to-Text
Image-to-Image
Image-to-Video
Unconditional Image Generation
Video Classification
Text-to-Video
Zero-Shot Image Classification
Mask Generation
Zero-Shot Object Detection
Text-to-3D
Image-to-3D
Image Feature Extraction
Keypoint Detection
Video-to-Video
Natural Language Processing
Text Classification
Token Classification
Table Question Answering
Question Answering
Zero-Shot Classification
Translation
Summarization
Feature Extraction
Text Generation
Fill-Mask
Sentence Similarity
Text Ranking
Audio
Text-to-Speech
Text-to-Audio
Automatic Speech Recognition
Audio-to-Audio
Audio Classification
Voice Activity Detection
Tabular
Tabular Classification
Tabular Regression
Time Series Forecasting
Reinforcement Learning
Reinforcement Learning
Robotics
Other
Graph Machine Learning
Apply filters
Models
151
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
audio-text-to-text
Clear all
mradermacher/Qwen2-Audio-7B-Instruct-GGUF
Audio-Text-to-Text
•
8B
•
Updated
Jul 31
•
445
mradermacher/Qwen2-Audio-7B-Instruct-i1-GGUF
Audio-Text-to-Text
•
8B
•
Updated
Jul 11
•
322
•
1
mradermacher/Qwen2-Audio-7B-GGUF
Audio-Text-to-Text
•
8B
•
Updated
Jul 31
•
230
•
1
mradermacher/Qwen2-Audio-7B-i1-GGUF
Audio-Text-to-Text
•
8B
•
Updated
Jul 11
•
323
•
1
Akshay3399/Chiiki
Audio-Text-to-Text
•
Updated
Jun 5
ahmedferah/darija_model
Audio-Text-to-Text
•
37.8M
•
Updated
Jun 5
fixie-ai/ultravox-v0_6-llama-3_1-8b
Audio-Text-to-Text
•
0.7B
•
Updated
Jul 5
•
5.68k
•
3
fixie-ai/ultravox-v0_6-gemma-3-27b
Audio-Text-to-Text
•
0.7B
•
Updated
Sep 12
•
634
•
8
fixie-ai/ultravox-v0_6-qwen-3-32b
Audio-Text-to-Text
•
0.7B
•
Updated
Sep 12
•
936
•
11
mispeech/midashenglm-7b-0804-fp32
Audio-Text-to-Text
•
8B
•
Updated
30 days ago
•
32.8k
•
74
DeSTA-ntu/DeSTA2.5-Audio-Llama-3.1-8B
Audio-Text-to-Text
•
0.1B
•
Updated
Oct 15
•
1.06k
•
6
ICTNLP/StreamUni-Phi4
Audio-Text-to-Text
•
6B
•
Updated
Jul 14
•
6
warshanks/Voxtral-Mini-3B-2507-FP8
Audio-Text-to-Text
•
Updated
Jul 17
•
1
MohamedRashad/Voxtral-Mini-3B-2507-transformers
Audio-Text-to-Text
•
5B
•
Updated
Jul 18
•
56
•
4
MohamedRashad/Voxtral-Small-24B-2507-transformers
Audio-Text-to-Text
•
24B
•
Updated
Jul 18
•
44
•
2
bigdefence/bigvox
Audio-Text-to-Text
•
2B
•
Updated
Jul 19
•
2
•
1
bigdefence/Bigvox-HyperCLOVAX-Audio
Audio-Text-to-Text
•
1B
•
Updated
Aug 23
•
5
•
1
AINovice2005/Voxtral-Mini-3B-2507-smashed
Audio-Text-to-Text
•
Updated
Jul 24
urroxyz/Voxtral-Mini-3B-2507_timestamped
Audio-Text-to-Text
•
Updated
Jul 27
•
2
bigdefence/Bigvox-Kanana-Audio
Audio-Text-to-Text
•
3B
•
Updated
Aug 23
•
4
•
1
bartowski/mistralai_Voxtral-Mini-3B-2507-GGUF
Audio-Text-to-Text
•
4B
•
Updated
Jul 28
•
3.4k
•
10
stduhpf/Voxtral-Small-24B-2507-GGUF
Audio-Text-to-Text
•
24B
•
Updated
Jul 28
•
30
•
2
allenai/OLMoASR
Audio-Text-to-Text
•
Updated
Aug 28
•
66
bigdefence/Bigvox-Midm-Audio
Audio-Text-to-Text
•
3B
•
Updated
Aug 23
•
5
•
1
nvidia/audio-flamingo-2-SoundCoT
Audio-Text-to-Text
•
Updated
Aug 28
•
9
YirongSun/LLaSO-Base-3.8B-Instruct
Audio-Text-to-Text
•
4B
•
Updated
Aug 25
•
10
•
2
Gapeleon/Voxtral-Small-24B-2507
Audio-Text-to-Text
•
24B
•
Updated
Aug 5
•
4
bilguun/gemma-3n-E2B-it-audio-en-mn
Audio-Text-to-Text
•
Updated
Aug 8
•
1
Yi3852/MuFun-Instruct
Audio-Text-to-Text
•
9B
•
Updated
Aug 28
•
58
Yi3852/MuFun-ACEStep
Audio-Text-to-Text
•
9B
•
Updated
Aug 21
•
90
•
2
Previous
1
2
3
4
5
6
Next