Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper โข 2412.04424 โข Published Dec 5, 2024 โข 59
microsoft/LLM2CLIP-Openai-L-14-336 Zero-Shot Classification โข Updated Nov 24, 2024 โข 7.75k โข 34