Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements Paper • 2210.01970 • Published Sep 30, 2022 • 11
RAFT: A Real-World Few-Shot Text Classification Benchmark Paper • 2109.14076 • Published Sep 28, 2021 • 2
Datasets: A Community Library for Natural Language Processing Paper • 2109.02846 • Published Sep 7, 2021 • 10