view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality about 1 month ago • 71
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 7 days ago • 55
LLaVA-φ: Efficient Multi-Modal Assistant with Small Language Model Paper • 2401.02330 • Published Jan 4, 2024 • 17