Eval results?

by rombodawg - opened Dec 4, 2023

Discussion

rombodawg

Dec 4, 2023

I know this isnt a finished model, but im still curious of the benchmarks. Mainly mmlu and human eval.

jonabur

LumiOpen org Feb 21, 2024

We will be publishing more detailed results soon, but MMLU on the final checkpoint is 46.29 and HumanEval Pass@10 is 37.20. We hope to release an instruction tuned version soon, but are still evaluating open dataset options.

jonabur changed discussion status to closed Feb 21, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment