End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.9595
-- Bleu: 5.529
-- Gen Len: 15.9474
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
-| 3.5866        | 1.0   | 17734  | 3.3249          | 4.351  | 16.0208 |
-| 3.4502        | 2.0   | 35468  | 3.2435          | 4.6878 | 16.0742 |
-| 3.4103        | 3.0   | 53202  | 3.1736          | 4.9997 | 16.024  |
-| 3.306         | 4.0   | 70936  | 3.1176          | 5.0627 | 16.1368 |
-| 3.1849        | 5.0   | 88670  | 3.0681          | 5.2125 | 16.0203 |
-| 3.1681        | 6.0   | 106404 | 3.0311          | 5.2869 | 15.7844 |
-| 3.1283        | 7.0   | 124138 | 3.0028          | 5.3816 | 16.1057 |
-| 3.0484        | 8.0   | 141872 | 2.9804          | 5.3871 | 15.9089 |
-| 3.0153        | 9.0   | 159606 | 2.9638          | 5.5117 | 15.8761 |
-| 3.0429        | 10.0  | 177340 | 2.9595          | 5.529  | 15.9474 |
 ### Framework versions

 This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.6633
+- Bleu: 6.3134
+- Gen Len: 15.9835
 ## Model description
 | Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
+| 3.0928        | 1.0   | 17734  | 2.8330          | 5.4935 | 15.9053 |
+| 3.101         | 2.0   | 35468  | 2.8127          | 5.5409 | 15.8787 |
+| 3.0165        | 3.0   | 53202  | 2.7814          | 5.6622 | 15.9238 |
+| 2.9973        | 4.0   | 70936  | 2.7532          | 5.8108 | 15.8996 |
+| 2.8885        | 5.0   | 88670  | 2.7294          | 5.9077 | 15.8805 |
+| 2.8114        | 6.0   | 106404 | 2.7074          | 6.1401 | 15.9749 |
+| 2.7791        | 7.0   | 124138 | 2.6905          | 6.1567 | 15.9531 |
+| 2.7729        | 8.0   | 141872 | 2.6782          | 6.1865 | 15.9688 |
+| 2.7128        | 9.0   | 159606 | 2.6699          | 6.2233 | 16.063  |
+| 2.7398        | 10.0  | 177340 | 2.6633          | 6.3134 | 15.9835 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:46975bcd2e7480b26f221ba860538bc8dbe64cbb666e22db081bed1b6aaad404
 size 191081512

 version https://git-lfs.github.com/spec/v1
+oid sha256:af6ada7568eb6e75f298a8b60f42ce7052b3948ff38bd952975c9194922a19f0
 size 191081512

runs/Apr28_13-43-18_fb32f442c803/events.out.tfevents.1714311799.fb32f442c803.25.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c2c60a8acd44cbfb9a7290bce9d606a6a85f22714ba2d2418e26daab8026496
+size 85706

tokenizer_config.json CHANGED Viewed

@@ -930,8 +930,12 @@
   "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
   "model_max_length": 512,
   "pad_token": "<pad>",
   "tokenizer_class": "T5Tokenizer",
   "unk_token": "<unk>"
 }

   "clean_up_tokenization_spaces": true,
   "eos_token": "</s>",
   "extra_ids": 100,
+  "max_length": 512,
   "model_max_length": 512,
   "pad_token": "<pad>",
+  "stride": 0,
   "tokenizer_class": "T5Tokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "longest_first",
   "unk_token": "<unk>"
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bb2562c62a13d7b89c2992c3a3efcdddf69de2aad771394b18d65a9f2865a0a2
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:b5c04d5e14d29aacbd0099666019608d5183dc1ad99c8edf6599a5de4266db64
 size 5048