title: BigCodeBench Evaluator | |
emoji: 🥇 | |
colorFrom: green | |
colorTo: indigo | |
sdk: docker | |
app_file: app.py | |
disable_embedding: true | |
pinned: false | |
license: apache-2.0 | |
tags: | |
- leaderboard | |
- eval:code | |
- test:public | |
- judge:auto | |
Paper:arxiv.org/abs/2406.15877 |