-
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Paper • 2404.16821 • Published • 57 -
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings
Paper • 2404.16820 • Published • 17 -
MoDE: CLIP Data Experts via Clustering
Paper • 2404.16030 • Published • 14
atticus sims
atticusS
·
AI & ML interests
None yet
Organizations
Collections
1
spaces
1
models
None public yet
datasets
None public yet