hamishivi commited on
Commit
829015a
1 Parent(s): c0631a3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -46,7 +46,7 @@ Here is the performance of this model across benchmarks explored in our paper [H
46
 
47
  | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
48
  |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
49
- | 0.380 | 0.358 | 0.050 | 0.070 | 0.272 | 0.244 | 43.569 | 8.718 | 0.111 | 0.221 | 12.67 | 20.7 |
50
 
51
 
52
  If you use this model, please cite our work, the llama paper, and the original dataset:
 
46
 
47
  | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
48
  |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
49
+ | 38.0 | 35.8 | 5.0 | 7.0 | 27.2 | 24.4 | 43.6 | 8.7 | 11.1 | 22.1 | 12.7 | 20.7 |
50
 
51
 
52
  If you use this model, please cite our work, the llama paper, and the original dataset: