Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 1 day ago • 22
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements Paper • 2411.12044 • Published 4 days ago • 13