Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ Unreleased, untested, unfinished beta.
|
|
13 |
|
14 |
# Evaluations
|
15 |
|
16 |
-
We've only done very limited testing as yet. The epoch 4.5 checkpoint scores above 5 on MT-Bench (better than Alpaca-13B, worse than Llama2-7b-chat), while preliminary benchmarks suggest peak average performance was achieved roughly at epoch 4.
|
17 |
|
18 |
MT-bench Epoch 4.5 result:
|
19 |
```
|
|
|
13 |
|
14 |
# Evaluations
|
15 |
|
16 |
+
We've only done very limited testing as yet. The [epoch 4.5 checkpoint](https://huggingface.co/Open-Orca/oo-phi-1_5/commit/aa05eb2596d6d11951695d2e327616188d768880) scores above 5 on MT-Bench (better than Alpaca-13B, worse than Llama2-7b-chat), while preliminary benchmarks suggest peak average performance was achieved roughly at epoch 4.
|
17 |
|
18 |
MT-bench Epoch 4.5 result:
|
19 |
```
|