PaLI-X: On Scaling up a Multilingual Vision and Language Model Paper • 2305.18565 • Published May 29, 2023 • 3
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published Apr 18 • 38
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 28