Grounding DINO 1.5: Advance the "Edge" of Open-Set Object Detection Paper โข 2405.10300 โข Published May 16 โข 26
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks Paper โข 2311.06242 โข Published Nov 10, 2023 โข 84
Learning and Leveraging World Models in Visual Representation Learning Paper โข 2403.00504 โข Published Mar 1 โข 31
timm/maxvit_rmlp_base_rw_224.sw_in12k_ft_in1k Image Classification โข Updated May 11, 2023 โข 394 โข 1