27 Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs · 6 authors 1
19 Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers · 6 authors 1
8 StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion · 7 authors 1
6 UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures · 4 authors 1