allenai
/

open-instruct-dolly-7b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Jun 8, 2023

Commit

829015a

•

1 Parent(s): c0631a3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -46,7 +46,7 @@ Here is the performance of this model across benchmarks explored in our paper [H
 | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
 |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
-|    0.380    |    0.358    |    0.050   |  0.070  |    0.272   |  0.244  |        43.569       |        8.718       |       0.111       |        0.221       |           12.67           | 20.7    |
 If you use this model, please cite our work, the llama paper, and the original dataset:

 | MMLU 0-shot | MMLU 5-shot | GSM Direct | GSM CoT | BBH Direct | BBH CoT | TydiQA Gold-Passage | TydiQA Closed-book | Codex-Eval Pass@1 | Codex-Eval Pass@10 | AlpacaFarm vs Davinci-003 | Average |
 |:-----------:|:-----------:|:----------:|:-------:|:----------:|:-------:|:-------------------:|:------------------:|:-----------------:|:------------------:|:-------------------------:|---------|
+|    38.0    |    35.8    |    5.0   |  7.0  |    27.2   |  24.4  |        43.6       |        8.7       |       11.1       |        22.1       |           12.7           | 20.7    |
 If you use this model, please cite our work, the llama paper, and the original dataset: