Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
VibeVoice-1.5B
like
1.56k
Follow
Microsoft
15.2k
Text-to-Speech
Transformers
Safetensors
English
Chinese
vibevoice
text-generation
Podcast
arxiv:
2508.19205
arxiv:
2412.08635
License:
mit
Model card
Files
Files and versions
xet
Community
33
Train
Deploy
Use this model
refs/pr/16
VibeVoice-1.5B
Ctrl+K
Ctrl+K
5 contributors
History:
15 commits
Mkshmk
Upload leaves.csv
d64c721
verified
11 days ago
figures
update README
14 days ago
.gitattributes
Safe
1.6 kB
update README
14 days ago
README.md
Safe
7.3 kB
Update README.md
11 days ago
config.json
Safe
2.76 kB
Upload VibeVoice 1.5B model
14 days ago
leaves.csv
63 Bytes
Upload leaves.csv
11 days ago
model-00001-of-00003.safetensors
Safe
1.98 GB
xet
Upload VibeVoice 1.5B model
14 days ago
model-00002-of-00003.safetensors
Safe
1.98 GB
xet
Upload VibeVoice 1.5B model
14 days ago
model-00003-of-00003.safetensors
Safe
1.45 GB
xet
Upload VibeVoice 1.5B model
14 days ago
model.safetensors.index.json
Safe
123 kB
Upload VibeVoice 1.5B model
14 days ago
preprocessor_config.json
Safe
351 Bytes
Upload VibeVoice 1.5B model
14 days ago