Submitted by akhaliq 24 Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning · 10 authors 3
Submitted by akhaliq 22 Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression · 17 authors 1
Submitted by akhaliq 16 LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching · 6 authors 1
Submitted by akhaliq 16 Memory Augmented Language Models through Mixture of Word Experts · 5 authors 1
Submitted by akhaliq 14 AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort · 6 authors 2
Submitted by akhaliq 8 ProAgent: From Robotic Process Automation to Agentic Process Automation · 12 authors 1
Submitted by akhaliq 6 TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems · 12 authors 2
Submitted by akhaliq 4 GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration · 5 authors 1
Submitted by akhaliq 3 M$^{2}$UGen: Multi-modal Music Understanding and Generation with the Power of Large Language Models · 4 authors 1