laion/CLIP-ViT-H-14-laion2B-s32B-b79K Zero-Shot Image Classification β’ Updated 27 days ago β’ 996k β’ 356
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper β’ 2501.12224 β’ Published 27 days ago β’ 46
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper β’ 2501.12380 β’ Published 27 days ago β’ 82
Salesforce/xgen-mm-phi3-mini-instruct-dpo-r-v1.5 Image-Text-to-Text β’ Updated 15 days ago β’ 42 β’ 17
HuggingFaceM4/siglip-so400m-14-980-flash-attn2-navit Zero-Shot Image Classification β’ Updated Mar 7, 2024 β’ 5.27k β’ 44
ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding Paper β’ 2501.05452 β’ Published Jan 9 β’ 15
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper β’ 2501.05441 β’ Published Jan 9 β’ 88