Update README.md
Browse files
README.md
CHANGED
@@ -44,6 +44,18 @@ We use the prompt from [Stanford Alpaca](https://github.com/tatsu-lab/stanford_a
|
|
44 |
| --- | ---: | ---: | ---: | ---: | ---: |
|
45 |
| Model (13B) | 128 | 1e-5 | 5 | 2048 | 0 |
|
46 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
47 |
|
48 |
## Citation
|
49 |
|
|
|
44 |
| --- | ---: | ---: | ---: | ---: | ---: |
|
45 |
| Model (13B) | 128 | 1e-5 | 5 | 2048 | 0 |
|
46 |
|
47 |
+
## Performance
|
48 |
+
|
49 |
+
Compared to the llama2-chat, our models can have better average performance.<br>
|
50 |
+
|
51 |
+
| | Average | ARC | HellaSwag | MMLU | TruthfulQA | Alpaca_Eval | Avg Length |
|
52 |
+
|---|---|---|---|---|---|---|---|
|
53 |
+
| Llama-2-7b-chat | 56.335 | 52.9 | 78.55 | 48.32 | 45.57 | 71.37 | 1479 |
|
54 |
+
| Llama-2-13b-chat | 59.935 | 59.04| 81.94 | 54.64 | 44.12 | 81.09 | 1513 |
|
55 |
+
|||||||||
|
56 |
+
| claude_alpaca-7b | 57.78 | 56.66 | 81.17 | 46.58 | 46.71 | 71.23 | 1066 |
|
57 |
+
| claude_alpaca-13b | 61.29 | 61.18 | 84.08 | 55.74 | 44.18 | 78.93 | 1127 |
|
58 |
+
|
59 |
|
60 |
## Citation
|
61 |
|