ganse/word_wizards_final_t5-small

Files changed (5) hide show

README.md CHANGED Viewed

@@ -3,8 +3,6 @@ license: apache-2.0
 base_model: facebook/bart-base
 tags:
 - generated_from_trainer
-metrics:
-- rouge
 model-index:
 - name: my_awesome_billsum_model
   results: []
@@ -16,13 +14,6 @@ should probably proofread and complete it, then remove this comment. -->
 # my_awesome_billsum_model
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.0001
-- Rouge1: 1.0
-- Rouge2: 1.0
-- Rougel: 1.0
-- Rougelsum: 1.0
-- Gen Len: 11.0
 ## Model description
@@ -42,20 +33,19 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 2
-- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 1.0   | 20   | 0.0278          | 1.0    | 1.0    | 1.0    | 1.0       | 11.0    |
-| No log        | 2.0   | 40   | 0.0001          | 1.0    | 1.0    | 1.0    | 1.0       | 11.0    |
 ### Framework versions

 base_model: facebook/bart-base
 tags:
 - generated_from_trainer
 model-index:
 - name: my_awesome_billsum_model
   results: []
 # my_awesome_billsum_model
 This model is a fine-tuned version of [facebook/bart-base](https://huggingface.co/facebook/bart-base) on the None dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 1
+- eval_batch_size: 1
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 40   | 0.0000          | 1.0    | 1.0    | 1.0    | 1.0       | 11.0    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f3c4b4c3a0c138101c49e896417c9642e265e472f10213de10237adc80fa23b2
 size 557912620

 version https://git-lfs.github.com/spec/v1
+oid sha256:b7f6d164f9202cd39248a94dfdbcf01c58acd7a8216211ef4ed27c05179e8a5c
 size 557912620

runs/Dec07_00-08-29_26c4abc7d069/events.out.tfevents.1701907712.26c4abc7d069.27607.6 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a1a7f586a1a601f607746974cdbe66cc974a21eb8c1568a7209460b107ae7b8b
+size 6342

tokenizer.json CHANGED Viewed

@@ -2,11 +2,18 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 128,
     "strategy": "LongestFirst",
     "stride": 0
   },
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 1024,
     "strategy": "LongestFirst",
     "stride": 0
   },
+  "padding": {
+    "strategy": "BatchLongest",
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 1,
+    "pad_type_id": 0,
+    "pad_token": "<pad>"
+  },
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cb693c8a8cb15ed2c3ec7eaaf198916dab1a09e1c38ce766947043cd29d5a7f5
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:909dd7bcdc4bf4e55661c472fe6b8b41d693c9333d4312cf6a54f35cd03346b6
 size 4728