Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
FrancisRing
/
StableAvatar
like
64
Image-to-Video
Diffusers
ONNX
Safetensors
WanPipeline
video-generation
video diffusion transformer
audio-driven avatar animation
arxiv:
2508.08248
License:
mit
Model card
Files
Files and versions
xet
Community
5
Use this model
deee8ef
StableAvatar
Ctrl+K
Ctrl+K
3 contributors
History:
17 commits
FrancisRing
Upload models_t5_umt5-xxl-enc-bf16.pth
deee8ef
verified
9 days ago
StableAvatar-1.3B
Upload 2 files
9 days ago
Wan2.1-Fun-V1.1-1.3B-InP
Upload models_t5_umt5-xxl-enc-bf16.pth
9 days ago
assets
Upload 12 files
9 days ago
wav2vec2-base-960h
Upload 9 files
9 days ago
.gitattributes
Safe
2.47 kB
Upload 12 files
9 days ago
Kim_Vocal_2.onnx
Safe
66.8 MB
xet
Upload Kim_Vocal_2.onnx
9 days ago
README.md
25.5 kB
Update README.md
9 days ago
config.json
Safe
1.03 kB
Upload config.json
9 days ago