neuralmagic
/

Meta-Llama-3-70B-Instruct-FP8-KV

@@ -50,13 +50,16 @@ model.save_quantized(quantized_model_dir)
 ## Evaluation
 ### Open LLM Leaderboard evaluation scores
-|                          | Meta-Llama-3-70B-Instruct | Meta-Llama-3-70B-Instruct-FP8 | Meta-Llama-3-70B-Instruct-FP8-KV<br>(this model) |
-| :----------------------: | :-----------------------: | :---------------------------: | :----------------------------------------------: |
-| arc-c<br>25-shot         | 72.69                     | 72.61                         | 72.57                                            |
-| hellaswag<br>10-shot     | 85.50                     | 85.41                         | 85.32                                            |
-| mmlu<br>5-shot           | 80.18                     | 80.06                         | 79.69                                            |
-| truthfulqa<br>0-shot     | 62.90                     | 62.73                         | 61.92                                            |
-| winogrande<br>5-shot     | 83.34                     | 83.03                         | 83.66                                            |
-| gsm8k<br>5-shot          | 92.49                     | 91.12                         | 90.83                                            |
-| **Average<br>Accuracy**  | **79.51**                 | **79.16**                     | **79.00**                                        |
-| **Recovery**             | **100%**                  | **99.55%**                    | **99.36%**                                       |

 ## Evaluation
 ### Open LLM Leaderboard evaluation scores
+Model evaluation results obtained via [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness).
+|  Benchmark                                                | Meta-Llama-3-70B-Instruct | Meta-Llama-3-70B-Instruct-FP8 | Meta-Llama-3-70B-Instruct-FP8-KV<br>(this model) |
+| :-------------------------------------------------------: | :-----------------------: | :---------------------------: | :----------------------------------------------: |
+| [ARC-c](https://arxiv.org/abs/1911.01547)<br> 25-shot     | 72.69                     | 72.61                         | 72.57                                            |
+| [HellaSwag](https://arxiv.org/abs/1905.07830)<br> 10-shot | 85.50                     | 85.41                         | 85.32                                            |
+| [MMLU](https://arxiv.org/abs/2009.03300)<br> 5-shot       | 80.18                     | 80.06                         | 79.69                                            |
+| [TruthfulQA](https://arxiv.org/abs/2109.07958)<br> 0-shot | 62.90                     | 62.73                         | 61.92                                            |
+| [WinoGrande](https://arxiv.org/abs/1907.10641)<br> 5-shot | 83.34                     | 83.03                         | 83.66                                            |
+| [GSM8K](https://arxiv.org/abs/2110.14168)<br> 5-shot      | 92.49                     | 91.12                         | 90.83                                            |
+| **Average<br>Accuracy**                                   | **79.51**                 | **79.16**                     | **79.00**                                        |
+| **Recovery**                                              | **100%**                  | **99.55%**                    | **99.36%**                                       |