pinned
Running
415
πΌπ¬
None defined yet.
π€ Demo | π€ Paper | π arXiv | GitHub
We are a team from AI2, UCSB, and UWaterloo, and we are working on benchmarking vision language models.
Compare VLMs at WildVision-Arena and WildVision-Bench.
More chat and vote data will be updated reguarly. Eval script is released here WildVision-Bench
Contact: Bill Yuchen Lin (yuchenl@allenai.org) and Yujie Lu (yujielu@ucsb.edu)
Citation: If you found this huggingface space useful, please consider cite us:
@misc{lu2024wildvision,
title={WildVision: Evaluating Vision-Language Models in the Wild with Human Preferences},
author={Yujie Lu and Dongfu Jiang and Wenhu Chen and William Yang Wang and Yejin Choi and Bill Yuchen Lin},
year={2024},
eprint={2406.11069},
archivePrefix={arXiv},
primaryClass={id='cs.CV' full_name='Computer Vision and Pattern Recognition' is_active=True alt_name=None in_archive='cs' is_general=False description='Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.'}
}