ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper • 2504.01934 • Published 9 days ago • 20
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation Paper • 2403.08857 • Published Mar 13, 2024 • 3
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published May 14, 2024 • 25
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation Paper • 2403.08857 • Published Mar 13, 2024 • 3
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Paper • 2303.02489 • Published Mar 4, 2023