PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 17 days ago • 118
VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling Paper • 2112.05692 • Published Dec 10, 2021
SCENIC: A JAX Library for Computer Vision Research and Beyond Paper • 2110.11403 • Published Oct 18, 2021
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 28
Simple Open-Vocabulary Object Detection with Vision Transformers Paper • 2205.06230 • Published May 12, 2022 • 1