What is the humaneval? and humaneval plus scores?

by rombodawg - opened Nov 1, 2023

Discussion

rombodawg

Nov 1, 2023

You say its trained on code but you havnt posted any coding benchmarks, or even compared it to codellama

neval

Nov 2, 2023

How to download the models?

weitianwen

Skywork org Nov 2, 2023

You say its trained on code but you havnt posted any coding benchmarks, or even compared it to codellama

We did not focus on its coding capability. At 2T checkpoint, we did benchmark on Humaneval and MBPP. It scored 17.7 on human eval and 26 on MBPP, respectively.
No further benchmark on code was carried out there after.

weitianwen changed discussion status to closed Nov 2, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment