Vichentito
/

Nahuatl_Espanol_v1

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Vichentito commited on Jun 11

Commit

b902f18

•

1 Parent(s): 975d2c3

End of training

Files changed (2) hide show

README.md +8 -18
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6835
-- Bleu: 25.7459
-- Gen Len: 45.7999
 ## Model description
@@ -42,30 +42,20 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|
-| No log        | 0.3027 | 300  | 0.6651          | 24.827  | 46.2006 |
-| 0.9651        | 0.6054 | 600  | 0.6967          | 24.0607 | 45.3494 |
-| 0.9651        | 0.9082 | 900  | 0.7045          | 23.7928 | 46.1327 |
-| 1.001         | 1.2109 | 1200 | 0.7084          | 23.9299 | 46.4082 |
-| 0.8741        | 1.5136 | 1500 | 0.7156          | 23.9047 | 45.8685 |
-| 0.8741        | 1.8163 | 1800 | 0.7121          | 23.9386 | 45.7796 |
-| 0.8763        | 2.1191 | 2100 | 0.7083          | 24.5377 | 45.8846 |
-| 0.8763        | 2.4218 | 2400 | 0.7032          | 24.6723 | 46.1827 |
-| 0.7689        | 2.7245 | 2700 | 0.6988          | 24.7631 | 45.8793 |
-| 0.7599        | 3.0272 | 3000 | 0.6961          | 25.2701 | 45.7947 |
-| 0.7599        | 3.3300 | 3300 | 0.6935          | 25.4704 | 45.7461 |
-| 0.6782        | 3.6327 | 3600 | 0.6861          | 25.7835 | 45.9797 |
-| 0.6782        | 3.9354 | 3900 | 0.6835          | 25.7459 | 45.7999 |
 ### Framework versions
 - Transformers 4.41.2
-- Pytorch 2.3.0+cu121
 - Datasets 2.19.2
 - Tokenizers 0.19.1

 This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6778
+- Bleu: 29.2544
+- Gen Len: 42.705
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 8
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:------:|:----:|:---------------:|:-------:|:-------:|
+| No log        | 2.1277 | 300  | 0.7834          | 26.6199 | 41.9314 |
+| 0.3038        | 4.2553 | 600  | 0.6428          | 27.5908 | 42.4358 |
+| 0.3038        | 6.3830 | 900  | 0.6778          | 29.2544 | 42.705  |
 ### Framework versions
 - Transformers 4.41.2
+- Pytorch 2.1.0
 - Datasets 2.19.2
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a82ac66d4636a2339ab0beab4c1963c52642b945139f2136311e0a5d58519137
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:f80fd3005a7c42553118395fc244681f20c26391d35d2965dfa126fef8b9e008
 size 990345064