sksy
/

mt5-small-finetuned-amazon-fr-es

@@ -1,72 +1,72 @@
----
-library_name: transformers
-license: apache-2.0
-base_model: google/mt5-small
-tags:
-- summarization
-- generated_from_trainer
-metrics:
-- rouge
-model-index:
-- name: mt5-small-finetuned-amazon-en-es
-  results: []
----
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# mt5-small-finetuned-amazon-en-es
-This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.9815
-- Rouge1: 15.1268
-- Rouge2: 6.2834
-- Rougel: 14.1836
-- Rougelsum: 14.2891
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
-The following hyperparameters were used during training:
-- learning_rate: 5.6e-05
-- train_batch_size: 8
-- eval_batch_size: 8
-- seed: 42
-- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
-- lr_scheduler_type: linear
-- num_epochs: 8
-### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum |
-|:-------------:|:-----:|:-----:|:---------------:|:-------:|:------:|:-------:|:---------:|
-| 7.0578        | 1.0   | 1374  | 3.2895          | 11.9843 | 3.2125 | 11.6212 | 11.7535   |
-| 3.8304        | 2.0   | 2748  | 3.1339          | 15.0305 | 4.9763 | 14.6223 | 14.6737   |
-| 3.5073        | 3.0   | 4122  | 3.0671          | 13.8606 | 5.3467 | 13.1739 | 13.1314   |
-| 3.3503        | 4.0   | 5496  | 3.0203          | 15.0263 | 6.0731 | 14.3983 | 14.4509   |
-| 3.2406        | 5.0   | 6870  | 3.0035          | 15.0129 | 6.1964 | 14.2638 | 14.3531   |
-| 3.1817        | 6.0   | 8244  | 2.9983          | 15.3756 | 6.3463 | 14.462  | 14.5718   |
-| 3.1265        | 7.0   | 9618  | 2.9899          | 15.0742 | 6.3106 | 14.0789 | 14.224    |
-| 3.1088        | 8.0   | 10992 | 2.9815          | 15.1268 | 6.2834 | 14.1836 | 14.2891   |
-### Framework versions
-- Transformers 4.46.2
-- Pytorch 2.5.0+cu118
-- Datasets 3.1.0
-- Tokenizers 0.20.1

+---
+library_name: transformers
+license: apache-2.0
+base_model: google/mt5-small
+tags:
+- summarization
+- generated_from_trainer
+metrics:
+- rouge
+model-index:
+- name: mt5-small-finetuned-amazon-fr-es
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# mt5-small-finetuned-amazon-fr-es
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.9815
+- Rouge1: 15.1268
+- Rouge2: 6.2834
+- Rougel: 14.1836
+- Rougelsum: 14.2891
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5.6e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: linear
+- num_epochs: 8
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:------:|:-------:|:---------:|
+| 7.0578        | 1.0   | 1374  | 3.2895          | 11.9843 | 3.2125 | 11.6212 | 11.7535   |
+| 3.8304        | 2.0   | 2748  | 3.1339          | 15.0305 | 4.9763 | 14.6223 | 14.6737   |
+| 3.5073        | 3.0   | 4122  | 3.0671          | 13.8606 | 5.3467 | 13.1739 | 13.1314   |
+| 3.3503        | 4.0   | 5496  | 3.0203          | 15.0263 | 6.0731 | 14.3983 | 14.4509   |
+| 3.2406        | 5.0   | 6870  | 3.0035          | 15.0129 | 6.1964 | 14.2638 | 14.3531   |
+| 3.1817        | 6.0   | 8244  | 2.9983          | 15.3756 | 6.3463 | 14.462  | 14.5718   |
+| 3.1265        | 7.0   | 9618  | 2.9899          | 15.0742 | 6.3106 | 14.0789 | 14.224    |
+| 3.1088        | 8.0   | 10992 | 2.9815          | 15.1268 | 6.2834 | 14.1836 | 14.2891   |
+### Framework versions
+- Transformers 4.46.2
+- Pytorch 2.5.0+cu118
+- Datasets 3.1.0
+- Tokenizers 0.20.1