Update README.md
Browse files
README.md
CHANGED
@@ -79,6 +79,14 @@ The following hyperparameters were used during training:
|
|
79 |
| 0.2633 | 2.0 | 5000 | 0.4007 |
|
80 |
| 0.1205 | 3.0 | 7500 | 0.4703 |
|
81 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
82 |
|
83 |
### Framework versions
|
84 |
|
|
|
79 |
| 0.2633 | 2.0 | 5000 | 0.4007 |
|
80 |
| 0.1205 | 3.0 | 7500 | 0.4703 |
|
81 |
|
82 |
+
## Evaluation Results
|
83 |
+
The model was evaluated on an undisclosed dataset using a language modeling task. The evaluation results after 3 epochs of fine-tuning are as follows:
|
84 |
+
|
85 |
+
- Evaluation Loss: 0.3954
|
86 |
+
- Evaluation Runtime: 51.60 seconds
|
87 |
+
- Average Samples per Second: 96.89
|
88 |
+
- Average Steps per Second: 6.06
|
89 |
+
- Epoch: 3.0
|
90 |
|
91 |
### Framework versions
|
92 |
|