ajscalers
/

t5-small-finetuned-xsum

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

ajscalers commited on Apr 27, 2023

Commit

9df0298

•

1 Parent(s): b074060

update model card README.md

Files changed (1) hide show

README.md +6 -28

README.md CHANGED Viewed

@@ -4,24 +4,9 @@ tags:
 - generated_from_trainer
 datasets:
 - xsum
-metrics:
-- rouge
 model-index:
 - name: t5-small-finetuned-xsum
-  results:
-  - task:
-      name: Sequence-to-sequence Language Modeling
-      type: text2text-generation
-    dataset:
-      name: xsum
-      type: xsum
-      config: default
-      split: validation
-      args: default
-    metrics:
-    - name: Rouge1
-      type: rouge
-      value: 26.7156
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,13 +15,6 @@ should probably proofread and complete it, then remove this comment. -->
 # t5-small-finetuned-xsum
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
-It achieves the following results on the evaluation set:
-- Loss: 2.5616
-- Rouge1: 26.7156
-- Rouge2: 6.6843
-- Rougel: 20.819
-- Rougelsum: 20.8237
-- Gen Len: 18.8024
 ## Model description
@@ -56,8 +34,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -66,9 +44,9 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel | Rougelsum | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:------:|:---------:|:-------:|
-| 2.8067        | 1.0   | 1595 | 2.5616          | 26.7156 | 6.6843 | 20.819 | 20.8237   | 18.8024 |
 ### Framework versions

 - generated_from_trainer
 datasets:
 - xsum
 model-index:
 - name: t5-small-finetuned-xsum
+  results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # t5-small-finetuned-xsum
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 672
+- eval_batch_size: 672
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 304  | 2.6701          | 23.8174 | 5.0625 | 18.3813 | 18.3799   | 18.7324 |
 ### Framework versions