Correct HumanEval scores

#79
by Muennighoff - opened
BigScience Workshop org
edited Aug 14, 2022

Previous scores did not strip end of sequence tokens. The updated scores ignore the end of sequence tokens (</s>).
This is equivalent to evaluating on code generations decoded with tokenizer.decode(code_tokens, skip_special_tokens=True)

cc @TimeRobber @loubnabnl

Muennighoff changed pull request title from Update README.md to Correct HumanEval scores
Muennighoff changed pull request status to merged

Sign up or log in to comment