End of training
Browse files- README.md +33 -28
- model.safetensors +1 -1
README.md
CHANGED
@@ -18,13 +18,13 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
This model is a fine-tuned version of [AIRI-Institute/gena-lm-bigbird-base-t2t](https://huggingface.co/AIRI-Institute/gena-lm-bigbird-base-t2t) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 0.
|
22 |
-
- F1 Score: 0.
|
23 |
-
- Precision: 0.
|
24 |
-
- Recall: 0.
|
25 |
-
- Accuracy: 0.
|
26 |
-
- Auc: 0.
|
27 |
-
- Prc: 0.
|
28 |
|
29 |
## Model description
|
30 |
|
@@ -54,27 +54,32 @@ The following hyperparameters were used during training:
|
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
-
| Training Loss | Epoch | Step
|
58 |
-
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
-
| 0.
|
76 |
-
| 0.
|
77 |
-
| 0.
|
|
|
|
|
|
|
|
|
|
|
78 |
|
79 |
|
80 |
### Framework versions
|
|
|
18 |
|
19 |
This model is a fine-tuned version of [AIRI-Institute/gena-lm-bigbird-base-t2t](https://huggingface.co/AIRI-Institute/gena-lm-bigbird-base-t2t) on the None dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 0.4554
|
22 |
+
- F1 Score: 0.8794
|
23 |
+
- Precision: 0.8386
|
24 |
+
- Recall: 0.9243
|
25 |
+
- Accuracy: 0.8677
|
26 |
+
- Auc: 0.9401
|
27 |
+
- Prc: 0.9344
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
+
| Training Loss | Epoch | Step | Validation Loss | F1 Score | Precision | Recall | Accuracy | Auc | Prc |
|
58 |
+
|:-------------:|:------:|:-----:|:---------------:|:--------:|:---------:|:------:|:--------:|:------:|:------:|
|
59 |
+
| 0.5309 | 0.0840 | 500 | 0.4610 | 0.8214 | 0.7592 | 0.8947 | 0.7969 | 0.8757 | 0.8625 |
|
60 |
+
| 0.4688 | 0.1681 | 1000 | 0.4717 | 0.8298 | 0.7467 | 0.9336 | 0.8001 | 0.8862 | 0.8753 |
|
61 |
+
| 0.4569 | 0.2521 | 1500 | 0.4326 | 0.8304 | 0.7461 | 0.9362 | 0.8005 | 0.8889 | 0.8752 |
|
62 |
+
| 0.4361 | 0.3361 | 2000 | 0.4337 | 0.8186 | 0.8527 | 0.7870 | 0.8180 | 0.9035 | 0.8988 |
|
63 |
+
| 0.4222 | 0.4202 | 2500 | 0.4968 | 0.8434 | 0.7628 | 0.9430 | 0.8173 | 0.9095 | 0.8979 |
|
64 |
+
| 0.4233 | 0.5042 | 3000 | 0.3891 | 0.8396 | 0.8674 | 0.8135 | 0.8378 | 0.9207 | 0.9177 |
|
65 |
+
| 0.4031 | 0.5882 | 3500 | 0.3743 | 0.8564 | 0.8687 | 0.8444 | 0.8522 | 0.9262 | 0.9231 |
|
66 |
+
| 0.3739 | 0.6723 | 4000 | 0.3970 | 0.8520 | 0.8662 | 0.8383 | 0.8480 | 0.9275 | 0.9269 |
|
67 |
+
| 0.3891 | 0.7563 | 4500 | 0.4361 | 0.7852 | 0.9181 | 0.6859 | 0.8042 | 0.9277 | 0.9273 |
|
68 |
+
| 0.3856 | 0.8403 | 5000 | 0.3882 | 0.8518 | 0.8904 | 0.8164 | 0.8517 | 0.9309 | 0.9306 |
|
69 |
+
| 0.3926 | 0.9244 | 5500 | 0.3291 | 0.8693 | 0.8600 | 0.8789 | 0.8622 | 0.9328 | 0.9320 |
|
70 |
+
| 0.3737 | 1.0084 | 6000 | 0.3546 | 0.8571 | 0.8783 | 0.8370 | 0.8544 | 0.9331 | 0.9329 |
|
71 |
+
| 0.346 | 1.0924 | 6500 | 0.4352 | 0.8719 | 0.8378 | 0.9088 | 0.8606 | 0.9345 | 0.9317 |
|
72 |
+
| 0.3355 | 1.1765 | 7000 | 0.3880 | 0.8665 | 0.8560 | 0.8773 | 0.8590 | 0.9362 | 0.9350 |
|
73 |
+
| 0.3452 | 1.2605 | 7500 | 0.3991 | 0.8737 | 0.8279 | 0.9249 | 0.8605 | 0.9376 | 0.9368 |
|
74 |
+
| 0.3618 | 1.3445 | 8000 | 0.3564 | 0.8645 | 0.8804 | 0.8492 | 0.8612 | 0.9381 | 0.9382 |
|
75 |
+
| 0.3335 | 1.4286 | 8500 | 0.4719 | 0.8376 | 0.9110 | 0.7751 | 0.8432 | 0.9381 | 0.9381 |
|
76 |
+
| 0.3671 | 1.5126 | 9000 | 0.3808 | 0.8748 | 0.8607 | 0.8895 | 0.8672 | 0.9404 | 0.9405 |
|
77 |
+
| 0.3505 | 1.5966 | 9500 | 0.4061 | 0.8801 | 0.8690 | 0.8914 | 0.8733 | 0.9403 | 0.9405 |
|
78 |
+
| 0.3602 | 1.6807 | 10000 | 0.4968 | 0.8485 | 0.9112 | 0.7938 | 0.8521 | 0.9409 | 0.9412 |
|
79 |
+
| 0.348 | 1.7647 | 10500 | 0.4114 | 0.8781 | 0.8319 | 0.9298 | 0.8654 | 0.9403 | 0.9369 |
|
80 |
+
| 0.3435 | 1.8487 | 11000 | 0.3816 | 0.8745 | 0.8828 | 0.8663 | 0.8702 | 0.9415 | 0.9399 |
|
81 |
+
| 0.3701 | 1.9328 | 11500 | 0.3733 | 0.8641 | 0.8865 | 0.8428 | 0.8617 | 0.9395 | 0.9369 |
|
82 |
+
| 0.3281 | 2.0168 | 12000 | 0.4554 | 0.8794 | 0.8386 | 0.9243 | 0.8677 | 0.9401 | 0.9344 |
|
83 |
|
84 |
|
85 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 455871688
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dc29a07a5e440f2dd5926c46d9c9de90442b35a121f36a4ed0160c01c095f092
|
3 |
size 455871688
|