Submitted by akhaliq 28 Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression · 17 authors 1
Submitted by akhaliq 28 Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning · 10 authors 3
Submitted by akhaliq 20 LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching · 6 authors 1
Submitted by akhaliq 18 Memory Augmented Language Models through Mixture of Word Experts · 5 authors 1
Submitted by akhaliq 16 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort · 6 authors 3
Submitted by akhaliq 10 ProAgent: From Robotic Process Automation to Agentic Process Automation · 12 authors 1
Submitted by akhaliq 8 TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems · 12 authors 2
Submitted by akhaliq 6 M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models · 4 authors 1
Submitted by akhaliq 6 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration · 5 authors 1