Submitted by akhaliq 55 Amphion: An Open-Source Audio, Music and Speech Generation Toolkit · 13 authors 5
Submitted by akhaliq 39 ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent · 13 authors 1
Submitted by akhaliq 27 DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models · 6 authors 2
Submitted by akhaliq 16 Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models · 8 authors 1
Submitted by akhaliq 15 Extending Context Window of Large Language Models via Semantic Compression · 7 authors 1
Submitted by akhaliq 9 Faithful Persona-based Conversational Dataset Generation with Large Language Models · 5 authors 1