Eval Datasets - a automated-research-group Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

automated-research-group 's Collections

Models

Eval Datasets

updated Jan 11, 2024

openai/openai_humaneval

Viewer • Updated Jan 4, 2024 • 164 • 100k • 284
google-research-datasets/mbpp

Viewer • Updated Jan 4, 2024 • 1.4k • 95.4k • 166
ybisk/piqa

Updated Jan 18, 2024 • 366k • 89
lighteval/siqa

Viewer • Updated Oct 7, 2023 • 35.4k • 3.71k • 5
Rowan/hellaswag

Viewer • Updated Sep 28, 2023 • 60k • 386k • 111
allenai/winogrande

Updated Jan 18, 2024 • 334k • 61
allenai/ai2_arc

Viewer • Updated Dec 21, 2023 • 7.79k • 392k • 174
allenai/openbookqa

Viewer • Updated Jan 4, 2024 • 11.9k • 302k • 88
tau/commonsense_qa

Viewer • Updated Jan 4, 2024 • 12.1k • 280k • 91
google-research-datasets/natural_questions

Viewer • Updated Mar 11, 2024 • 26.3k • 7.55k • 95
mandarjoshi/trivia_qa

Viewer • Updated Jan 5, 2024 • 848k • 58.3k • 122
rajpurkar/squad

Viewer • Updated Mar 4, 2024 • 98.2k • 62.2k • 294
allenai/quac

Updated Jan 18, 2024 • 501 • 30
google/boolq

Viewer • Updated Jan 22, 2024 • 12.7k • 9.95k • 73
openai/gsm8k

Viewer • Updated Jan 4, 2024 • 17.6k • 358k • 631
hendrycks/competition_math

Updated Jun 8, 2023 • 141
cais/mmlu

Viewer • Updated Mar 8, 2024 • 231k • 155k • 412
maveriq/bigbenchhard

Viewer • Updated Sep 29, 2023 • 6.51k • 2.34k • 25
baber/agieval

Updated Oct 26, 2023 • 508 • 5

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs