summervent
commited on
Commit
•
0563629
1
Parent(s):
e03ab1a
update model card README.md
Browse files
README.md
CHANGED
@@ -15,12 +15,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model is a fine-tuned version of [sberbank-ai/ruT5-base](https://huggingface.co/sberbank-ai/ruT5-base) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
- Loss: 0.
|
19 |
-
- Rouge1: 28.
|
20 |
-
- Rouge2:
|
21 |
-
- Rougel:
|
22 |
-
- Rougelsum: 28.
|
23 |
-
- Gen Len:
|
24 |
|
25 |
## Model description
|
26 |
|
@@ -52,33 +52,33 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
54 |
|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
55 |
-
| 1.
|
56 |
-
| 0.
|
57 |
-
| 0.
|
58 |
-
| 0.
|
59 |
-
| 0.
|
60 |
-
| 0.
|
61 |
-
| 0.
|
62 |
-
| 0.
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.
|
72 |
-
| 0.
|
73 |
-
| 0.
|
74 |
-
| 0.
|
75 |
-
| 0.
|
76 |
-
| 0.
|
77 |
-
| 0.
|
78 |
-
| 0.
|
79 |
-
| 0.
|
80 |
-
| 0.
|
81 |
-
| 0.
|
82 |
|
83 |
|
84 |
### Framework versions
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [sberbank-ai/ruT5-base](https://huggingface.co/sberbank-ai/ruT5-base) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 0.1580
|
19 |
+
- Rouge1: 28.0246
|
20 |
+
- Rouge2: 14.6131
|
21 |
+
- Rougel: 28.0357
|
22 |
+
- Rougelsum: 28.1585
|
23 |
+
- Gen Len: 41.6429
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
54 |
|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
55 |
+
| 1.0444 | 0.04 | 500 | 0.5084 | 23.8669 | 11.0011 | 23.8074 | 23.9137 | 48.2589 |
|
56 |
+
| 0.9248 | 0.07 | 1000 | 0.3787 | 27.1237 | 13.047 | 27.128 | 27.286 | 44.6518 |
|
57 |
+
| 0.6515 | 0.11 | 1500 | 0.3261 | 26.7152 | 12.8472 | 26.7857 | 27.0912 | 44.6429 |
|
58 |
+
| 0.5533 | 0.14 | 2000 | 0.2929 | 27.5391 | 13.497 | 27.5056 | 27.7679 | 42.2232 |
|
59 |
+
| 0.5962 | 0.18 | 2500 | 0.2678 | 27.2948 | 13.5317 | 27.3026 | 27.4828 | 42.7411 |
|
60 |
+
| 0.5667 | 0.22 | 3000 | 0.2549 | 27.4315 | 13.7922 | 27.4858 | 27.6141 | 42.5 |
|
61 |
+
| 0.4532 | 0.25 | 3500 | 0.2460 | 28.0246 | 14.6131 | 28.0357 | 28.1585 | 42.3929 |
|
62 |
+
| 0.4552 | 0.29 | 4000 | 0.2320 | 27.5 | 13.9435 | 27.567 | 27.6953 | 42.1161 |
|
63 |
+
| 0.4245 | 0.32 | 4500 | 0.2264 | 27.3214 | 13.3333 | 27.3884 | 27.5502 | 42.1607 |
|
64 |
+
| 0.4109 | 0.36 | 5000 | 0.2182 | 27.5 | 13.9435 | 27.567 | 27.6953 | 42.2411 |
|
65 |
+
| 0.3826 | 0.4 | 5500 | 0.2115 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.9196 |
|
66 |
+
| 0.4112 | 0.43 | 6000 | 0.2066 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.9018 |
|
67 |
+
| 0.4006 | 0.47 | 6500 | 0.1980 | 28.0246 | 14.6131 | 28.0357 | 28.1585 | 41.8304 |
|
68 |
+
| 0.3494 | 0.5 | 7000 | 0.1944 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.9554 |
|
69 |
+
| 0.3225 | 0.54 | 7500 | 0.1928 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.8929 |
|
70 |
+
| 0.3525 | 0.58 | 8000 | 0.1869 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.6071 |
|
71 |
+
| 0.3828 | 0.61 | 8500 | 0.1821 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.5446 |
|
72 |
+
| 0.3287 | 0.65 | 9000 | 0.1781 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.5714 |
|
73 |
+
| 0.3276 | 0.68 | 9500 | 0.1778 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.6786 |
|
74 |
+
| 0.3054 | 0.72 | 10000 | 0.1727 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.4375 |
|
75 |
+
| 0.3685 | 0.76 | 10500 | 0.1728 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.6964 |
|
76 |
+
| 0.3454 | 0.79 | 11000 | 0.1700 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.75 |
|
77 |
+
| 0.3056 | 0.83 | 11500 | 0.1641 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.7143 |
|
78 |
+
| 0.3399 | 0.86 | 12000 | 0.1606 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.7143 |
|
79 |
+
| 0.3079 | 0.9 | 12500 | 0.1600 | 27.5 | 13.9435 | 27.567 | 27.6953 | 41.6429 |
|
80 |
+
| 0.2646 | 0.94 | 13000 | 0.1591 | 28.0246 | 14.6131 | 28.0357 | 28.1585 | 41.5446 |
|
81 |
+
| 0.2297 | 0.97 | 13500 | 0.1580 | 28.0246 | 14.6131 | 28.0357 | 28.1585 | 41.6429 |
|
82 |
|
83 |
|
84 |
### Framework versions
|