Reliable Fidelity and Diversity Metrics for Generative Models Paper • 2002.09797 • Published Feb 23, 2020
SILC: Improving Vision Language Pretraining with Self-Distillation Paper • 2310.13355 • Published Oct 20, 2023 • 9
SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance Paper • 2311.16241 • Published Nov 27, 2023 • 2
Learning Graph Embeddings for Compositional Zero-shot Learning Paper • 2102.01987 • Published Feb 3, 2021
Introducing Language Guidance in Prompt-based Continual Learning Paper • 2308.15827 • Published Aug 30, 2023
GiT: Towards Generalist Vision Transformer through Universal Language Interface Paper • 2403.09394 • Published Mar 14, 2024 • 27
TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters Paper • 2410.23168 • Published Oct 30, 2024 • 24
Active Data Curation Effectively Distills Large-Scale Multimodal Models Paper • 2411.18674 • Published Nov 27, 2024 • 1
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 20 days ago • 128