ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving Paper β’ 2404.16771 β’ Published Apr 25, 2024 β’ 19
IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models Paper β’ 2403.13535 β’ Published Mar 20, 2024 β’ 23
InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation Paper β’ 2404.19427 β’ Published Apr 30, 2024 β’ 75
DynamicID: Zero-Shot Multi-ID Image Personalization with Flexible Facial Editability Paper β’ 2503.06505 β’ Published about 1 month ago β’ 1
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement Paper β’ 2504.01934 β’ Published 6 days ago β’ 20
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Paper β’ 2504.02949 β’ Published 5 days ago β’ 13
Inference-Time Scaling for Generalist Reward Modeling Paper β’ 2504.02495 β’ Published 5 days ago β’ 42
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation Paper β’ 2504.02782 β’ Published 5 days ago β’ 52
SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper β’ 2504.02436 β’ Published 5 days ago β’ 29
Be Yourself: Bounded Attention for Multi-Subject Text-to-Image Generation Paper β’ 2403.16990 β’ Published Mar 25, 2024 β’ 25
FlashFace: Human Image Personalization with High-fidelity Identity Preservation Paper β’ 2403.17008 β’ Published Mar 25, 2024 β’ 21
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes Paper β’ 2503.23461 β’ Published 9 days ago β’ 87
Synthetic Video Enhances Physical Fidelity in Video Synthesis Paper β’ 2503.20822 β’ Published 14 days ago β’ 16
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework Paper β’ 2503.21758 β’ Published 12 days ago β’ 18
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model Paper β’ 2503.21144 β’ Published 12 days ago β’ 23
VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness Paper β’ 2503.21755 β’ Published 12 days ago β’ 31
ClassDiffusion: More Aligned Personalization Tuning with Explicit Class Guidance Paper β’ 2405.17532 β’ Published May 27, 2024 β’ 1
LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis Paper β’ 2503.21749 β’ Published 12 days ago β’ 25