GuysTrans
/

bart-base-re-attention-seq-512

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

GuysTrans commited on Nov 17, 2023

Commit

f736cb3

·

1 Parent(s): cd1f940

End of training

Files changed (3) hide show

README.md +16 -0
generation_config.json +1 -1
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -1,6 +1,8 @@
 ---
 tags:
 - generated_from_trainer
 model-index:
 - name: bart-base-re-attention-seq-512
   results: []
@@ -12,6 +14,13 @@ should probably proofread and complete it, then remove this comment. -->
 # bart-base-re-attention-seq-512
 This model was trained from scratch on the None dataset.
 ## Model description
@@ -38,6 +47,13 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 1
 ### Framework versions
 - Transformers 4.33.0

 ---
 tags:
 - generated_from_trainer
+metrics:
+- rouge
 model-index:
 - name: bart-base-re-attention-seq-512
   results: []
 # bart-base-re-attention-seq-512
 This model was trained from scratch on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 1.0170
+- Rouge1: 34.1887
+- Rouge2: 25.9559
+- Rougel: 32.5277
+- Rougelsum: 33.5841
+- Gen Len: 25.9109
 ## Model description
 - lr_scheduler_type: linear
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| 2.149         | 1.0   | 18247 | 1.0170          | 34.1887 | 25.9559 | 32.5277 | 33.5841   | 25.9109 |
 ### Framework versions
 - Transformers 4.33.0

generation_config.json CHANGED Viewed

@@ -5,7 +5,7 @@
   "eos_token_id": 2,
   "forced_bos_token_id": 0,
   "forced_eos_token_id": 2,
-  "max_length": 512,
   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,

   "eos_token_id": 2,
   "forced_bos_token_id": 0,
   "forced_eos_token_id": 2,
+  "max_length": 26,
   "no_repeat_ngram_size": 3,
   "num_beams": 4,
   "pad_token_id": 1,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e60a89dbd6297fc07e77fb92d74b46f685426d2f4aecac8e1ae142bcc3968267
 size 558018637

 version https://git-lfs.github.com/spec/v1
+oid sha256:32c130c4aa9443012a8d50c17d798b6d8b1e9e5a920e8ca697dd666c1fe8a536
 size 558018637