-
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Paper • 2505.02707 • Published • 86 -
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Paper • 2505.02823 • Published • 5 -
PixelHacker: Image Inpainting with Structural and Semantic Consistency
Paper • 2504.20438 • Published • 44 -
Improving Editability in Image Generation with Layer-wise Memory
Paper • 2505.01079 • Published • 29
Cedric
cedhons
AI & ML interests
None yet
Organizations
None yet
Computer-Vison
-
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play
Paper • 2505.02707 • Published • 86 -
MUSAR: Exploring Multi-Subject Customization from Single-Subject Dataset via Attention Routing
Paper • 2505.02823 • Published • 5 -
PixelHacker: Image Inpainting with Structural and Semantic Consistency
Paper • 2504.20438 • Published • 44 -
Improving Editability in Image Generation with Layer-wise Memory
Paper • 2505.01079 • Published • 29
models
0
None public yet
datasets
0
None public yet