Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Paper • 2403.09622 • Published Mar 14 • 16
CCEdit: Creative and Controllable Video Editing via Diffusion Models Paper • 2309.16496 • Published Sep 28, 2023 • 9
Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering Paper • 2406.10208 • Published Jun 14 • 21
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Paper • 2404.02905 • Published Apr 3 • 65
Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering Paper • 2403.09622 • Published Mar 14 • 16
LISA: Reasoning Segmentation via Large Language Model Paper • 2308.00692 • Published Aug 1, 2023 • 1 • 1