RAFT: A Real-World Few-Shot Text Classification Benchmark Paper β’ 2109.14076 β’ Published Sep 28, 2021 β’ 2
GEMv2: Multilingual NLG Benchmarking in a Single Line of Code Paper β’ 2206.11249 β’ Published Jun 22, 2022
AfroDigits: A Community-Driven Spoken Digit Dataset for African Languages Paper β’ 2303.12582 β’ Published Mar 22, 2023 β’ 20
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper β’ 2210.01970 β’ Published Sep 30, 2022 β’ 11