End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: apache-2.0
-base_model: t5-small
 tags:
 - generated_from_trainer
 metrics:
@@ -15,11 +15,11 @@ should probably proofread and complete it, then remove this comment. -->
 # nepali_t5
-This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.4274
-- Bleu: 4.4727
-- Gen Len: 16.0017
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
-| 5.0           | 1.0   | 17734  | 4.7335          | 2.2286 | 15.5907 |
-| 4.4395        | 2.0   | 35468  | 4.2401          | 2.9281 | 15.7406 |
-| 4.1509        | 3.0   | 53202  | 3.9709          | 3.206  | 16.1203 |
-| 3.9609        | 4.0   | 70936  | 3.7968          | 3.6191 | 15.8338 |
-| 3.8746        | 5.0   | 88670  | 3.6712          | 3.8795 | 16.0679 |
-| 3.7316        | 6.0   | 106404 | 3.5811          | 3.9517 | 15.9977 |
-| 3.7038        | 7.0   | 124138 | 3.5185          | 4.2873 | 16.0255 |
-| 3.5782        | 8.0   | 141872 | 3.4695          | 4.3817 | 16.0927 |
-| 3.5957        | 9.0   | 159606 | 3.4387          | 4.4197 | 16.0783 |
-| 3.564         | 10.0  | 177340 | 3.4274          | 4.4727 | 16.0017 |
 ### Framework versions

 ---
 license: apache-2.0
+base_model: rujengelal/nepali_t5
 tags:
 - generated_from_trainer
 metrics:
 # nepali_t5
+This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.9595
+- Bleu: 5.529
+- Gen Len: 15.9474
 ## Model description
 | Training Loss | Epoch | Step   | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
+| 3.5866        | 1.0   | 17734  | 3.3249          | 4.351  | 16.0208 |
+| 3.4502        | 2.0   | 35468  | 3.2435          | 4.6878 | 16.0742 |
+| 3.4103        | 3.0   | 53202  | 3.1736          | 4.9997 | 16.024  |
+| 3.306         | 4.0   | 70936  | 3.1176          | 5.0627 | 16.1368 |
+| 3.1849        | 5.0   | 88670  | 3.0681          | 5.2125 | 16.0203 |
+| 3.1681        | 6.0   | 106404 | 3.0311          | 5.2869 | 15.7844 |
+| 3.1283        | 7.0   | 124138 | 3.0028          | 5.3816 | 16.1057 |
+| 3.0484        | 8.0   | 141872 | 2.9804          | 5.3871 | 15.9089 |
+| 3.0153        | 9.0   | 159606 | 2.9638          | 5.5117 | 15.8761 |
+| 3.0429        | 10.0  | 177340 | 2.9595          | 5.529  | 15.9474 |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "t5-small",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

 {
+  "_name_or_path": "rujengelal/nepali_t5",
   "architectures": [
     "T5ForConditionalGeneration"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:14d93818b254100cb7503c2e4353164b9d45ae107bdab3e7aa64b3376840ad3e
 size 191081512

 version https://git-lfs.github.com/spec/v1
+oid sha256:46975bcd2e7480b26f221ba860538bc8dbe64cbb666e22db081bed1b6aaad404
 size 191081512

runs/Apr28_02-23-46_f05d94549f92/events.out.tfevents.1714271027.f05d94549f92.24.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1c3d5db294217a20f8f1364e2f13feaa66985323ab519b635d19323a7f658d1
+size 85706

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:75f7ea09b70d0e93b7fe6de0bab2000d62989352d1636f4a12be1cb2f817986f
 size 5048

 version https://git-lfs.github.com/spec/v1
+oid sha256:bb2562c62a13d7b89c2992c3a3efcdddf69de2aad771394b18d65a9f2865a0a2
 size 5048