Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2406.11069

about 11 hours ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6 • 25
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6 • 12
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7 • 39
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7 • 20

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16 • 13

WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences

Paper • 2406.11069 • Published Jun 16 • 13

Qwen/Qwen-VL

Text Generation • Updated Jan 25 • 13.8k • 219
google/pix2struct-large

Image-to-Text • Updated Sep 6, 2023 • 105k • 34
THUDM/cogagent-chat-hf

Text Generation • Updated 2 days ago • 2.57k • 67
openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • Updated Sep 25 • 28.5k • 1.38k

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16 • 77
Aria Everyday Activities Dataset

Paper • 2402.13349 • Published Feb 20 • 30
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Paper • 2403.04132 • Published Mar 7 • 38
SaulLM-7B: A pioneering Large Language Model for Law

Paper • 2403.03883 • Published Mar 6 • 77

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs