BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 27
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes Paper • 2005.04790 • Published May 10, 2020
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality Paper • 2204.03162 • Published Apr 7, 2022 • 1
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Paper • 1804.07461 • Published Apr 20, 2018 • 4
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems Paper • 1905.00537 • Published May 2, 2019 • 2
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 28
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 47
Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA Paper • 1911.06258 • Published Nov 14, 2019
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking Paper • 2106.06052 • Published May 21, 2021
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 47
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 28
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants Paper • 2112.09062 • Published Dec 16, 2021
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks Paper • 2204.01906 • Published Apr 5, 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality Paper • 2204.03162 • Published Apr 7, 2022 • 1