Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Evaluation datasets
community
Request to join this org
AI & ML interests
None defined yet.
Team members
4
models
None public yet
datasets
62
Sort: Recently updated
lighteval/aimo_progress_prize_1
Viewer
•
Updated
17 days ago
•
6
lighteval/mt-bench
Viewer
•
Updated
Mar 19
•
162
•
1
lighteval/bbh
Updated
Jan 31
•
25.7k
•
1
lighteval/big_bench_hard
Viewer
•
Updated
Oct 17, 2023
•
1
•
2
lighteval/MATH
Viewer
•
Updated
Oct 17, 2023
•
2.59k
•
17
lighteval/natural_questions_clean
Viewer
•
Updated
Oct 17, 2023
•
27
lighteval/agi_eval_en
Updated
Oct 17, 2023
•
2
•
1
lighteval/siqa
Viewer
•
Updated
Oct 7, 2023
•
35.8k
•
3
lighteval/trivia_qa
Viewer
•
Updated
Oct 7, 2023
•
35
lighteval/mutual_harness
Viewer
•
Updated
Aug 9, 2023
•
1
Expand 62 datasets