Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
SoraWatermarkRemover
Log In
Sign Up
microsoft
/
Phi-4-multimodal-instruct
like
1.55k
Follow
Microsoft
17k
Automatic Speech Recognition
Transformers
Safetensors
24 languages
phi4mm
text-generation
nlp
code
audio
speech-summarization
speech-translation
visual-question-answering
phi-4-multimodal
phi
phi-4-mini
custom_code
arxiv:
2503.01743
arxiv:
2407.13833
License:
mit
Model card
Files
Files and versions
xet
Community
86
Deploy
Use this model
bd4b39b
Phi-4-multimodal-instruct
/
examples
/
what_is_shown_in_this_image.wav
nguyenbh
Add examples
bd4b39b
10 months ago
download
Copy download link
history
Safe
113 kB
This file contains binary data. It cannot be displayed, but you can still
download
it.