loki hojulnir's picture

loki hojulnir

DoesntKnowAI

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago
tiiuae/Falcon3-demo
liked a model 1 day ago
Xiaojian9992024/SmolMoE-4x360M
liked a dataset 1 day ago
AIDC-AI/Ovis-dataset
View all activity

Organizations

a11a1's profile picture

DoesntKnowAI's activity

reacted to Xenova's post with ๐Ÿ”ฅ 15 days ago
view post
Post
7136
We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. โšก๏ธ

Generate 10 seconds of speech in ~1 second for $0.

What will you build? ๐Ÿ”ฅ
webml-community/kokoro-webgpu

The most difficult part was getting the model running in the first place, but the next steps are simple:
โœ‚๏ธ Implement sentence splitting, allowing for streamed responses
๐ŸŒ Multilingual support (only phonemization left)

Who wants to help?
ยท