A collection of papers about image encoder + text decoder for document AI.
Inui
Norm
AI & ML interests
Stable Diffusion; Large Language Model; Object Detection; OCR
Organizations
Collections
4
Image Generation with diffusion-based methods & tricks for stable diffusion
-
BLIP-Diffusion: Pre-trained Subject Representation for Controllable Text-to-Image Generation and Editing
Paper • 2305.14720 • Published • 1 -
GlyphControl: Glyph Conditional Control for Visual Text Generation
Paper • 2305.18259 • Published • 1 -
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion
Paper • 2310.03502 • Published • 74
models
2
datasets
None public yet