lilloukas commited on
Commit
f573b6f
1 Parent(s): 0e1a3ca

Update README.md

Browse files

open llm leaderboard update

Files changed (1) hide show
  1. README.md +5 -5
README.md CHANGED
@@ -17,11 +17,11 @@ Platypus-30B is an instruction fine-tuned model based on the LLaMA-30B transform
17
 
18
  | Metric | Value |
19
  |-----------------------|-------|
20
- | MMLU (5-shot) | 64.2 |
21
- | ARC (25-shot) | 64.6 |
22
- | HellaSwag (10-shot) | 84.3 |
23
- | TruthfulQA (0-shot) | 45.8 |
24
- | Avg. | 64.7 |
25
 
26
  We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above.
27
 
 
17
 
18
  | Metric | Value |
19
  |-----------------------|-------|
20
+ | MMLU (5-shot) | 64.1 |
21
+ | ARC (25-shot) | 57.6 |
22
+ | HellaSwag (10-shot) | 81.9 |
23
+ | TruthfulQA (0-shot) | 45.3 |
24
+ | Avg. | 62.2 |
25
 
26
  We use state-of-the-art [Language Model Evaluation Harness](https://github.com/EleutherAI/lm-evaluation-harness) to run the benchmark tests above.
27