Project Page: https://jixiaozhong.github.io/Sonic/
ComfyUI: https://github.com/smthemex/ComfyUI_Sonic
Kadir Nar PRO
kadirnar
AI & ML interests
AI Research Engineer ๐ค Building Omni & TTS Models
Recent Activity
Organizations
kadirnar's activity

replied to
their
post
14 days ago
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis
Paper
โข
2501.04561
โข
Published
โข
16
โข
4
DiTAR: Diffusion Transformer Autoregressive Modeling for Speech Generation
Paper
โข
2502.03930
โข
Published
โข
1
Update README.md
#2 opened 17 days ago
by
MateoSP
