ChartThinker: A Contextual Chain-of-Thought Approach to Optimized Chart Summarization Paper • 2403.11236 • Published Mar 17, 2024 • 1
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance Paper • 2407.06937 • Published Jul 9, 2024 • 1
Hunyuan-DiT: A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding Paper • 2405.08748 • Published May 14, 2024 • 19
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation Paper • 2403.08857 • Published Mar 13, 2024 • 3
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining Paper • 2303.02489 • Published Mar 4, 2023