Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Paper • 2403.09622 • Published Mar 14 • 16
CCEdit: Creative and Controllable Video Editing via Diffusion Models Paper • 2309.16496 • Published Sep 28, 2023 • 9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper • 2404.02905 • Published Apr 3 • 65