26 Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies · 3 authors
10 On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation · 4 authors