Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper โข 2405.10300 โข Published May 16 โข 26
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper โข 2311.06242 โข Published Nov 10, 2023 โข 84
Learning and Leveraging World Models in Visual Representation Learning Paper โข 2403.00504 โข Published Mar 1 โข 31