Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper • 2210.01970 • Published Sep 30, 2022 • 11
Datasets: A Community Library for Natural Language Processing Paper • 2109.02846 • Published Sep 7, 2021 • 10
HuggingFace's Transformers: State-of-the-art Natural Language Processing Paper • 1910.03771 • Published Oct 9, 2019 • 16
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Paper • 1704.05426 • Published Apr 18, 2017 • 1
XNLI: Evaluating Cross-lingual Sentence Representations Paper • 1809.05053 • Published Sep 13, 2018 • 1
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way Paper • 2205.11465 • Published May 23, 2022 • 1
Does Putting a Linguist in the Loop Improve NLU Data Collection? Paper • 2104.07179 • Published Apr 15, 2021 • 1
The Pile: An 800GB Dataset of Diverse Text for Language Modeling Paper • 2101.00027 • Published Dec 31, 2020 • 6
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages Paper • 2303.12582 • Published Mar 22, 2023 • 20
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 28
Running on CPU Upgrade 146 🏆 Open Portuguese LLM Leaderboard Track, rank and evaluate open LLMs in Portuguese