SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 8 days ago • 118
view article Article PaliGemma 2 Mix - New Instruction Vision Language Models by Google 10 days ago • 60
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 790
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated Dec 13, 2024 • 145