End of training
Browse files
README.md
CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
21 |
|
22 |
This model is a fine-tuned version of [bayartsogt/mongolian-roberta-base](https://huggingface.co/bayartsogt/mongolian-roberta-base) on the None dataset.
|
23 |
It achieves the following results on the evaluation set:
|
24 |
-
- Loss: 0.
|
25 |
-
- Precision: 0.
|
26 |
-
- Recall: 0.
|
27 |
-
- F1: 0.
|
28 |
-
- Accuracy: 0.
|
29 |
|
30 |
## Model description
|
31 |
|
@@ -50,15 +50,17 @@ The following hyperparameters were used during training:
|
|
50 |
- seed: 42
|
51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
52 |
- lr_scheduler_type: linear
|
53 |
-
- num_epochs:
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|
58 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
|
|
|
|
62 |
|
63 |
|
64 |
### Framework versions
|
|
|
21 |
|
22 |
This model is a fine-tuned version of [bayartsogt/mongolian-roberta-base](https://huggingface.co/bayartsogt/mongolian-roberta-base) on the None dataset.
|
23 |
It achieves the following results on the evaluation set:
|
24 |
+
- Loss: 0.0790
|
25 |
+
- Precision: 0.9189
|
26 |
+
- Recall: 0.9316
|
27 |
+
- F1: 0.9252
|
28 |
+
- Accuracy: 0.9794
|
29 |
|
30 |
## Model description
|
31 |
|
|
|
50 |
- seed: 42
|
51 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
52 |
- lr_scheduler_type: linear
|
53 |
+
- num_epochs: 5
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|
58 |
|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
|
59 |
+
| 0.3764 | 1.0 | 60 | 0.1122 | 0.7718 | 0.8424 | 0.8055 | 0.9604 |
|
60 |
+
| 0.1021 | 2.0 | 120 | 0.0908 | 0.8087 | 0.8760 | 0.8410 | 0.9679 |
|
61 |
+
| 0.0698 | 3.0 | 180 | 0.0862 | 0.8342 | 0.8861 | 0.8594 | 0.9703 |
|
62 |
+
| 0.0476 | 4.0 | 240 | 0.0779 | 0.9056 | 0.9242 | 0.9148 | 0.9784 |
|
63 |
+
| 0.0337 | 5.0 | 300 | 0.0790 | 0.9189 | 0.9316 | 0.9252 | 0.9794 |
|
64 |
|
65 |
|
66 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 496259468
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eeb04b3f86e2c4ff1a7b747d2d8a1130a05da82d701239d007b17a60154e82e4
|
3 |
size 496259468
|
runs/May26_21-15-16_debf144f0859/events.out.tfevents.1716758117.debf144f0859.34.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b42c71ceb6cd7b71f60d1a77f3b4fa6d178625cbf73431ae84af765de6058e70
|
3 |
+
size 8660
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4920
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0c8cfedf281e59b503c0f3f8759513d7120c28a3810362ac28185fe2a974ac8c
|
3 |
size 4920
|