Collections
Discover the best community collections!
Collections including paper arxiv:2401.01808
-
Boundary Attention: Learning to Find Faint Boundaries at Any Resolution
Paper • 2401.00935 • Published • 16 -
Taming Mode Collapse in Score Distillation for Text-to-3D Generation
Paper • 2401.00909 • Published • 8 -
Q-Refine: A Perceptual Quality Refiner for AI-Generated Image
Paper • 2401.01117 • Published • 6 -
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
Paper • 2401.01173 • Published • 9
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 26 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 26 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 4 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 29
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 26 -
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models
Paper • 2401.05252 • Published • 43 -
Scalable Pre-training of Large Autoregressive Image Models
Paper • 2401.08541 • Published • 35 -
Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Paper • 2401.09417 • Published • 51
-
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper • 2312.02155 • Published • 11 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 15 -
FaceStudio: Put Your Face Everywhere in Seconds
Paper • 2312.02663 • Published • 27 -
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 26