codegen_eval nuprl/MultiPL-E Updated Jun 16, 2023 • 64.2k • 32 openai_humaneval Viewer • Updated Jan 4 • 434k • 184 Running 795 📈 Big Code Models Leaderboard Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming Paper • 2402.14261 • Published Feb 22 • 10
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming Paper • 2402.14261 • Published Feb 22 • 10