GeorgiaTechResearchInstitute
/

galactica-30b-evol-instruct-70k

Text Generation

text-generation-inference

Model card Files Files and versions Community

blair-johnson commited on Jun 27, 2023

Commit

e9839ea

·

1 Parent(s): 409bd1a

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -107,6 +107,15 @@ GALACTICA 30B Evol-Instruct was fine-tuned in 196 hours using 16 A100 80GB GPUs,
 ## Performance and Limitations
 Qualitative evaluation suggests that the evol-instruct-70k fine-tuned Galactica models are signficantly more controllable and attentive to user prompts than the Alpaca fine-tuned GALPACA models.
 ## Works Cited

 ## Performance and Limitations
+Common benchmark scores generated using the [Eleuther AI LLM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness/tree/master).
+| Task | Version | Metric | Value | Stderr |
+|------|---------|--------|-------|--------|
+| arc_challenge 25-shot | 0 | acc | 0.4684 | 0.146 |
+|                       |   | acc_norm | 0.4787 | 0.146 |
+|hellaswag 10-shot| 0 | acc | 0.4705 | 0.0050 |
+|                 |   | acc_norm | 0.6111 | 0.0049 |
 Qualitative evaluation suggests that the evol-instruct-70k fine-tuned Galactica models are signficantly more controllable and attentive to user prompts than the Alpaca fine-tuned GALPACA models.
 ## Works Cited