Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
walterShen
's Collections
Code LMs Evaluation
Code LMs Benchmark
Prompt Engineering
World Model
Synthetic Data
Code LMs
Code LMs Benchmark
updated
Mar 6
Upvote
1
Running
771
π
Big Code Models Leaderboard
Running
328
π
Can Ai Code Results
openai_humaneval
Viewer
β’
Updated
Jan 4
β’
416k
β’
179
mbpp
Viewer
β’
Updated
Jan 4
β’
194k
β’
96
nuprl/MultiPL-E
Updated
Jun 16, 2023
β’
75.2k
β’
31
evalplus/mbppplus
Viewer
β’
Updated
Apr 17
β’
6.8k
BAAI/TACO
Updated
Jan 15
β’
555
β’
55
princeton-nlp/SWE-bench
Viewer
β’
Updated
Apr 15
β’
30.5k
β’
54
codeparrot/apps
Viewer
β’
Updated
Oct 20, 2022
β’
26.1k
β’
99
cruxeval-org/cruxeval
Viewer
β’
Updated
Jan 23
β’
3.46k
β’
9
tianyang/repobench_python_v1.1
Viewer
β’
Updated
Feb 27
β’
69
β’
4
SciPhi/textbooks-are-all-you-need-lite
Viewer
β’
Updated
Sep 30, 2023
β’
94
β’
166
nampdn-ai/tiny-codes
Viewer
β’
Updated
Sep 30, 2023
β’
575
β’
195
math_qa
Viewer
β’
Updated
Jan 18
β’
32.3k
β’
68
deepmind/code_contests
Viewer
β’
Updated
Jun 11, 2023
β’
4.04k
β’
87
FudanSELab/ClassEval
Viewer
β’
Updated
Jan 8
β’
173
β’
6
ML4SE2023-G1-WizardCoder/ML4SE23_G1_MBCPP-SCoT
Viewer
β’
Updated
Oct 25, 2023
β’
2
Muennighoff/quixbugs
Viewer
β’
Updated
Mar 26, 2023
β’
1
bigcode/humanevalpack
Updated
18 days ago
β’
835k
β’
53
NTU-NLP-sg/xCodeEval
Updated
Jan 2
β’
78
β’
29
JetBrains-Research/commit-chronicle
Viewer
β’
Updated
Oct 5, 2023
β’
87
β’
4
tianyang/repobench_java_v1.1
Viewer
β’
Updated
Feb 27
β’
6
zijwang/CrossCodeEval
Updated
Oct 19, 2023
Upvote
1
Share collection
View history
Collection guide
Browse collections