openai/clip-vit-large-patch14-336 Zero-Shot Image Classification • Updated Oct 4, 2022 • 5.83M • 240
MedVLM-R1: Incentivizing Medical Reasoning Capability of Vision-Language Models (VLMs) via Reinforcement Learning Paper • 2502.19634 • Published Feb 26 • 63
Running on Zero 62 62 VLM R1 Referral Expression 💬 Mark regions in images based on text descriptions
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper • 2307.02486 • Published Jul 5, 2023 • 80