Karzan
/

walamakan-t5-base

@@ -2,6 +2,8 @@
 base_model: Karzan/walamakan-t5-base
 tags:
 - generated_from_trainer
 model-index:
 - name: walamakan-t5-base
   results: []
@@ -13,6 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # walamakan-t5-base
 This model is a fine-tuned version of [Karzan/walamakan-t5-base](https://huggingface.co/Karzan/walamakan-t5-base) on an unknown dataset.
 ## Model description
@@ -39,13 +45,32 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
-| No log        | 0.96  | 21   | 1.3691          | 0.0  | 19.0    |
 ### Framework versions

 base_model: Karzan/walamakan-t5-base
 tags:
 - generated_from_trainer
+metrics:
+- bleu
 model-index:
 - name: walamakan-t5-base
   results: []
 # walamakan-t5-base
 This model is a fine-tuned version of [Karzan/walamakan-t5-base](https://huggingface.co/Karzan/walamakan-t5-base) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.3291
+- Bleu: 0.0
+- Gen Len: 19.0
 ## Model description
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:----:|:-------:|
+| No log        | 0.96  | 21   | 1.3668          | 0.0  | 19.0    |
+| No log        | 1.97  | 43   | 1.3613          | 0.0  | 19.0    |
+| No log        | 2.98  | 65   | 1.3530          | 0.0  | 19.0    |
+| No log        | 3.99  | 87   | 1.3503          | 0.0  | 19.0    |
+| No log        | 5.0   | 109  | 1.3476          | 0.0  | 19.0    |
+| No log        | 5.96  | 130  | 1.3429          | 0.0  | 19.0    |
+| No log        | 6.97  | 152  | 1.3415          | 0.0  | 19.0    |
+| No log        | 7.98  | 174  | 1.3408          | 0.0  | 19.0    |
+| No log        | 8.99  | 196  | 1.3382          | 0.0  | 19.0    |
+| No log        | 9.99  | 218  | 1.3387          | 0.0  | 19.0    |
+| No log        | 10.96 | 239  | 1.3301          | 0.0  | 19.0    |
+| No log        | 11.97 | 261  | 1.3344          | 0.0  | 19.0    |
+| No log        | 12.97 | 283  | 1.3323          | 0.0  | 19.0    |
+| No log        | 13.98 | 305  | 1.3312          | 0.0  | 19.0    |
+| No log        | 14.99 | 327  | 1.3329          | 0.0  | 19.0    |
+| No log        | 16.0  | 349  | 1.3323          | 0.0  | 19.0    |
+| No log        | 16.96 | 370  | 1.3289          | 0.0  | 19.0    |
+| No log        | 17.97 | 392  | 1.3291          | 0.0  | 19.0    |
+| No log        | 18.98 | 414  | 1.3289          | 0.0  | 19.0    |
+| No log        | 19.26 | 420  | 1.3291          | 0.0  | 19.0    |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c3e134b4a856674b2ca06ce11c3cd228a1458bdedaf9bcaad9262f3dacc7d8f1
 size 990236853

 version https://git-lfs.github.com/spec/v1
+oid sha256:a59f94bc011750159f9cacbf5d8460a128e92b472cc90e70399327aa8cccab0a
 size 990236853

tokenizer.json CHANGED Viewed

@@ -1,11 +1,6 @@
 {
   "version": "1.0",
-  "truncation": {
-    "direction": "Right",
-    "max_length": 1024,
-    "strategy": "LongestFirst",
-    "stride": 0
-  },
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2da484213889f405fb841ef3ea597caa56d45c6ea603341ac9cf26117859e10b
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:fddf7503fe020ee260f8b533639282e2179749eadc701001ba0f6fef7d9bff43
 size 4155