Align Your Steps: Optimizing Sampling Schedules in Diffusion Models Paper • 2404.14507 • Published 16 days ago • 21
Efficient Transformer Encoders for Mask2Former-style models Paper • 2404.15244 • Published 15 days ago • 1
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper • 2404.15653 • Published 15 days ago • 24