24 CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data · 8 authors 2
11 ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning · 6 authors 1