clement-cvll
commited on
Commit
•
b8a332f
1
Parent(s):
d50cea9
End of training
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
20 |
|
21 |
This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
-
- Loss: 1.
|
24 |
|
25 |
## Model description
|
26 |
|
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|
|
55 |
-
| No log | 1.0 | 3 | 2.
|
56 |
-
| No log | 2.0 | 6 | 2.
|
57 |
-
| No log | 3.0 | 9 | 1.
|
58 |
-
| 2.
|
59 |
-
| 2.
|
60 |
|
61 |
|
62 |
### Framework versions
|
|
|
20 |
|
21 |
This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
|
22 |
It achieves the following results on the evaluation set:
|
23 |
+
- Loss: 1.7939
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|
|
55 |
+
| No log | 1.0 | 3 | 2.6773 |
|
56 |
+
| No log | 2.0 | 6 | 2.3213 |
|
57 |
+
| No log | 3.0 | 9 | 1.9551 |
|
58 |
+
| 2.5229 | 4.0 | 12 | 1.8233 |
|
59 |
+
| 2.5229 | 5.0 | 15 | 1.7939 |
|
60 |
|
61 |
|
62 |
### Framework versions
|