End of training
Browse files- README.md +13 -9
- model.safetensors +1 -1
- runs/Sep17_19-06-04_ip-10-192-12-112/events.out.tfevents.1726599970.ip-10-192-12-112.1319.9 +3 -0
- runs/Sep17_19-10-17_ip-10-192-12-112/events.out.tfevents.1726600222.ip-10-192-12-112.1319.10 +3 -0
- runs/Sep17_19-12-26_ip-10-192-12-112/events.out.tfevents.1726600351.ip-10-192-12-112.1319.11 +3 -0
- runs/Sep17_19-21-44_ip-10-192-12-112/events.out.tfevents.1726600908.ip-10-192-12-112.1319.12 +3 -0
- runs/Sep17_19-24-59_ip-10-192-12-112/events.out.tfevents.1726601104.ip-10-192-12-112.1319.13 +3 -0
- runs/Sep17_19-26-18_ip-10-192-12-112/events.out.tfevents.1726601183.ip-10-192-12-112.1319.14 +3 -0
- runs/Sep17_19-29-03_ip-10-192-12-112/events.out.tfevents.1726601349.ip-10-192-12-112.1319.15 +3 -0
- runs/Sep17_19-29-30_ip-10-192-12-112/events.out.tfevents.1726601373.ip-10-192-12-112.1319.16 +3 -0
- runs/Sep17_19-31-19_ip-10-192-12-112/events.out.tfevents.1726601486.ip-10-192-12-112.1319.17 +3 -0
- tokenizer.json +1 -6
- training_args.bin +1 -1
README.md
CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss: 4.
|
22 |
-
- Bleu: 0.
|
23 |
-
- Gen Len: 16.
|
24 |
|
25 |
## Model description
|
26 |
|
@@ -40,19 +40,23 @@ More information needed
|
|
40 |
|
41 |
The following hyperparameters were used during training:
|
42 |
- learning_rate: 2e-05
|
43 |
-
- train_batch_size:
|
44 |
-
- eval_batch_size:
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
-
- num_epochs:
|
49 |
- mixed_precision_training: Native AMP
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
-
| Training Loss | Epoch | Step | Validation Loss | Bleu
|
54 |
-
|
55 |
-
| 4.
|
|
|
|
|
|
|
|
|
56 |
|
57 |
|
58 |
### Framework versions
|
|
|
18 |
|
19 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 4.1552
|
22 |
+
- Bleu: 0.0813
|
23 |
+
- Gen Len: 16.4792
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
40 |
|
41 |
The following hyperparameters were used during training:
|
42 |
- learning_rate: 2e-05
|
43 |
+
- train_batch_size: 4
|
44 |
+
- eval_batch_size: 4
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
+
- num_epochs: 5
|
49 |
- mixed_precision_training: Native AMP
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
+
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
54 |
+
|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
|
55 |
+
| 4.9485 | 1.0 | 144 | 4.4460 | 0.0 | 16.875 |
|
56 |
+
| 4.515 | 2.0 | 288 | 4.2735 | 0.0 | 16.625 |
|
57 |
+
| 4.3579 | 3.0 | 432 | 4.1977 | 0.0 | 16.7014 |
|
58 |
+
| 4.3095 | 4.0 | 576 | 4.1644 | 0.0818 | 16.5417 |
|
59 |
+
| 4.2744 | 5.0 | 720 | 4.1552 | 0.0813 | 16.4792 |
|
60 |
|
61 |
|
62 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 242041896
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ef0a51adef99bb0c335a98548f2911f9d3bd1b2bb9d0b40953248c994da60fc1
|
3 |
size 242041896
|
runs/Sep17_19-06-04_ip-10-192-12-112/events.out.tfevents.1726599970.ip-10-192-12-112.1319.9
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3a4082b6c65546b8ae48cfdf920a351e6947871dadf38a0d9d0b3f2c9fdf2d7b
|
3 |
+
size 6883
|
runs/Sep17_19-10-17_ip-10-192-12-112/events.out.tfevents.1726600222.ip-10-192-12-112.1319.10
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:969eaf27eafc6749abea2746ec355b2a9e677601e08aab7736f7a42fcb831bb8
|
3 |
+
size 6159
|
runs/Sep17_19-12-26_ip-10-192-12-112/events.out.tfevents.1726600351.ip-10-192-12-112.1319.11
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ce01b0863e76b937d6e5c5bbe7207332120d68149ae17f1c2b8c5cccd6e0d7ce
|
3 |
+
size 6160
|
runs/Sep17_19-21-44_ip-10-192-12-112/events.out.tfevents.1726600908.ip-10-192-12-112.1319.12
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3dc293ceccc054f6e29f1b60143509718e12c4ad9e2fe7fdcc2bf5c142dfe788
|
3 |
+
size 6161
|
runs/Sep17_19-24-59_ip-10-192-12-112/events.out.tfevents.1726601104.ip-10-192-12-112.1319.13
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:df1d8d2f00aa162654f8675c7839ef1269b74a7e6e08096b14df3a165cdc3240
|
3 |
+
size 6161
|
runs/Sep17_19-26-18_ip-10-192-12-112/events.out.tfevents.1726601183.ip-10-192-12-112.1319.14
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:82fdbdad9fd512f7c6be8c9ab63ed4dff6862ec73a51b64726d05c6447b20512
|
3 |
+
size 7465
|
runs/Sep17_19-29-03_ip-10-192-12-112/events.out.tfevents.1726601349.ip-10-192-12-112.1319.15
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a4bdd2d6245130e7014688617a43d655ac25ec0720fb1ad9f40593a8bc98f0a6
|
3 |
+
size 6161
|
runs/Sep17_19-29-30_ip-10-192-12-112/events.out.tfevents.1726601373.ip-10-192-12-112.1319.16
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:80f2ba2efca47533f703af195cfe726437eac47c040910f7fcc277c4727f8b7d
|
3 |
+
size 7465
|
runs/Sep17_19-31-19_ip-10-192-12-112/events.out.tfevents.1726601486.ip-10-192-12-112.1319.17
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eb99c20945da1555d4c8cf92ce006968a74fcdc2a596e1296156e9fff9aff13c
|
3 |
+
size 9208
|
tokenizer.json
CHANGED
@@ -1,11 +1,6 @@
|
|
1 |
{
|
2 |
"version": "1.0",
|
3 |
-
"truncation":
|
4 |
-
"direction": "Right",
|
5 |
-
"max_length": 128,
|
6 |
-
"strategy": "LongestFirst",
|
7 |
-
"stride": 0
|
8 |
-
},
|
9 |
"padding": null,
|
10 |
"added_tokens": [
|
11 |
{
|
|
|
1 |
{
|
2 |
"version": "1.0",
|
3 |
+
"truncation": null,
|
|
|
|
|
|
|
|
|
|
|
4 |
"padding": null,
|
5 |
"added_tokens": [
|
6 |
{
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5368
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d0182e1228d08bf24a4de1d1df8964f45fa2a8a5c19b929a6b7c93ef082f1ebc
|
3 |
size 5368
|