Model save
Browse files- README.md +14 -14
- model.safetensors +1 -1
README.md
CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
- Loss: 1.
|
21 |
-
- Bleu: 23.
|
22 |
-
- Gen Len: 27.
|
23 |
|
24 |
## Model description
|
25 |
|
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
|
|
44 |
- seed: 42
|
45 |
- gradient_accumulation_steps: 2
|
46 |
- total_train_batch_size: 16
|
47 |
-
- optimizer:
|
48 |
- lr_scheduler_type: linear
|
49 |
- training_steps: 100000
|
50 |
|
@@ -52,16 +52,16 @@ The following hyperparameters were used during training:
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
54 |
|:-------------:|:------:|:------:|:---------------:|:-------:|:-------:|
|
55 |
-
| 1.
|
56 |
-
| 1.
|
57 |
-
| 1.
|
58 |
-
| 1.
|
59 |
-
| 1.
|
60 |
-
| 1.
|
61 |
-
| 1.
|
62 |
-
| 1.
|
63 |
-
| 1.
|
64 |
-
| 1.
|
65 |
|
66 |
|
67 |
### Framework versions
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 1.7607
|
21 |
+
- Bleu: 23.421
|
22 |
+
- Gen Len: 27.6243
|
23 |
|
24 |
## Model description
|
25 |
|
|
|
44 |
- seed: 42
|
45 |
- gradient_accumulation_steps: 2
|
46 |
- total_train_batch_size: 16
|
47 |
+
- optimizer: Adafactor
|
48 |
- lr_scheduler_type: linear
|
49 |
- training_steps: 100000
|
50 |
|
|
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
54 |
|:-------------:|:------:|:------:|:---------------:|:-------:|:-------:|
|
55 |
+
| 1.7882 | 0.2778 | 10000 | 1.9278 | 19.7853 | 28.4147 |
|
56 |
+
| 1.6619 | 0.5556 | 20000 | 1.8710 | 21.3803 | 27.667 |
|
57 |
+
| 1.6007 | 0.8333 | 30000 | 1.8397 | 22.2715 | 27.317 |
|
58 |
+
| 1.5269 | 1.1111 | 40000 | 1.8205 | 21.9329 | 27.704 |
|
59 |
+
| 1.498 | 1.3889 | 50000 | 1.8134 | 22.4836 | 27.63 |
|
60 |
+
| 1.4801 | 1.6667 | 60000 | 1.7941 | 22.727 | 27.582 |
|
61 |
+
| 1.462 | 1.9444 | 70000 | 1.7766 | 23.0372 | 27.5903 |
|
62 |
+
| 1.4182 | 2.2222 | 80000 | 1.7724 | 23.6231 | 27.4233 |
|
63 |
+
| 1.4079 | 2.5 | 90000 | 1.7663 | 23.2604 | 27.7623 |
|
64 |
+
| 1.4037 | 2.7778 | 100000 | 1.7607 | 23.421 | 27.6243 |
|
65 |
|
66 |
|
67 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 241984552
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:126657589c9bed87d7d908daa53833a4e1cdc2e808eaee0c62189876a1169c78
|
3 |
size 241984552
|