Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
automated-research-group
's Collections
Models
Eval Datasets
Eval Datasets
updated
Jan 11
Upvote
-
openai_humaneval
Viewer
•
Updated
Jan 4
•
344k
•
176
mbpp
Viewer
•
Updated
Jan 4
•
174k
•
96
piqa
Viewer
•
Updated
Jan 18
•
523k
•
77
lighteval/siqa
Viewer
•
Updated
Oct 7, 2023
•
96.2k
•
3
Rowan/hellaswag
Viewer
•
Updated
Sep 28, 2023
•
10.9k
•
72
winogrande
Viewer
•
Updated
Jan 18
•
335k
•
45
allenai/ai2_arc
Viewer
•
Updated
Dec 21, 2023
•
709k
•
83
allenai/openbookqa
Viewer
•
Updated
Jan 4
•
1.63k
•
60
tau/commonsense_qa
Viewer
•
Updated
Jan 4
•
7.45k
•
51
natural_questions
Viewer
•
Updated
Mar 11
•
1.16k
•
47
mandarjoshi/trivia_qa
Viewer
•
Updated
Jan 5
•
3.11k
•
57
rajpurkar/squad
Viewer
•
Updated
Mar 4
•
4.03k
•
211
quac
Viewer
•
Updated
Jan 18
•
734
•
24
google/boolq
Viewer
•
Updated
Jan 22
•
5.75k
•
53
gsm8k
Viewer
•
Updated
Jan 4
•
386k
•
242
hendrycks/competition_math
Updated
Jun 8, 2023
•
10.3k
•
88
cais/mmlu
Viewer
•
Updated
Mar 8
•
1.68M
•
223
maveriq/bigbenchhard
Viewer
•
Updated
Sep 29, 2023
•
42.3k
•
6
baber/agieval
Updated
Oct 26, 2023
•
3
•
4
Upvote
-
Share collection
View history
Collection guide
Browse collections