VLM-R1 model for Open-Vocabulary Object Detection
Identify and highlight regions in an image based on text description
Open Agent Leaderboard