siddartha-abacus
commited on
Commit
•
5fc7d7b
1
Parent(s):
5320d73
Update README.md
Browse files
README.md
CHANGED
@@ -47,7 +47,7 @@ To do a proper abalation we compared the performance of 4 models trained for ~1
|
|
47 |
Orca, ShareGPT). Here are the results:
|
48 |
|
49 |
| Model | Trainable Params | Train Loss | Eval Loss | GSM8K | TruthfulQA |
|
50 |
-
| :-----| ------: | ---------: |
|
51 |
| Mistral 7B | 0 | - | - | 0.374 | 0.426 |
|
52 |
| Mistral 10B | 0 | - | - | 0.290 | 0.407 |
|
53 |
| Mistral 7B + LoRA r=12 | 31M | 0.412 | 0.366 | 0.514 | 0.499 |
|
|
|
47 |
Orca, ShareGPT). Here are the results:
|
48 |
|
49 |
| Model | Trainable Params | Train Loss | Eval Loss | GSM8K | TruthfulQA |
|
50 |
+
| :-----| ------: | ---------: | -------: | ----: | ---------: |
|
51 |
| Mistral 7B | 0 | - | - | 0.374 | 0.426 |
|
52 |
| Mistral 10B | 0 | - | - | 0.290 | 0.407 |
|
53 |
| Mistral 7B + LoRA r=12 | 31M | 0.412 | 0.366 | 0.514 | 0.499 |
|