ACE++: Instruction-Based Image Creation and Editing via Context-Aware Content Filling Paper β’ 2501.02487 β’ Published Jan 5
Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers Paper β’ 2310.05400 β’ Published Oct 9, 2023 β’ 1
Eliminating Lipschitz Singularities in Diffusion Models Paper β’ 2306.11251 β’ Published Jun 20, 2023
StyleBooth: Image Style Editing with Multimodal Instruction Paper β’ 2404.12154 β’ Published Apr 18, 2024
Composer: Creative and Controllable Image Synthesis with Composable Conditions Paper β’ 2302.09778 β’ Published Feb 20, 2023
Group Diffusion Transformers are Unsupervised Multitask Learners Paper β’ 2410.15027 β’ Published Oct 19, 2024
ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer Paper β’ 2410.00086 β’ Published Sep 30, 2024 β’ 12
CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases Paper β’ 2408.03910 β’ Published Aug 7, 2024 β’ 18
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning Paper β’ 2406.14130 β’ Published Jun 20, 2024 β’ 10