nvidia
/

Minitron-8B-Base

Model card Files Files and versions Community

srvm commited on Jul 23

Commit

019a9d0

•

1 Parent(s): 66505a7

Add evaluation preview

Files changed (1) hide show

README.md +23 -0

README.md CHANGED Viewed

@@ -53,6 +53,29 @@ print(output_text)
 Minitron is released under the [NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf).
 ## Citation
 If you find our work helpful, please consider citing our paper:

 Minitron is released under the [NVIDIA Open Model License Agreement](https://developer.download.nvidia.com/licenses/nvidia-open-model-license-agreement-june-2024.pdf).
+## Evaluation Results
+*5-shot performance.* Language Understanding evaluated using [Massive Multitask Language Understanding](https://arxiv.org/abs/2009.03300):
+| Average |
+| :---- |
+| 63.8 |
+*Zero-shot performance.* Evaluated using select datasets from the [LM Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) with additions:
+HellaSwag | Winogrande | GSM8K| ARC-C | XLSum |
+| :------------- | :------------- | :------------- | :------------- | :------------- |
+| 80.7 | 79.0 | 51.3  | 52.6 | 31.2
+*Code generation performance*. Evaluated using [HumanEval](https://github.com/openai/human-eval):
+| p@1, 0-Shot |
+| :------------- |
+| 31.6 |
+Please refer to our [paper](https://arxiv.org/abs/2407.14679) for the full set of results.
 ## Citation
 If you find our work helpful, please consider citing our paper: