End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -11,13 +11,11 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/jadavpur/huggingface/runs/nqfl3694)
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/jadavpur/huggingface/runs/nqfl3694)
 # flan-t5-base-cnn_dailymail
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6839
 ## Model description
@@ -42,18 +40,20 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.8567        | 1.0   | 125  | 1.6839          |
 ### Framework versions
-- Transformers 4.41.0
 - Pytorch 2.1.2
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # flan-t5-base-cnn_dailymail
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9141
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.8436        | 1.0   | 125  | 1.8955          |
+| 2.0678        | 2.0   | 250  | 1.9134          |
+| 1.8895        | 3.0   | 375  | 1.9141          |
 ### Framework versions
+- Transformers 4.41.1
 - Pytorch 2.1.2
 - Datasets 2.19.1
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -56,7 +56,7 @@
   },
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
-  "transformers_version": "4.41.0",
   "use_cache": true,
   "vocab_size": 32128
 }

   },
   "tie_word_embeddings": false,
   "torch_dtype": "float32",
+  "transformers_version": "4.41.1",
   "use_cache": true,
   "vocab_size": 32128
 }

generation_config.json CHANGED Viewed

@@ -3,5 +3,5 @@
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
-  "transformers_version": "4.41.0"
 }

   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
+  "transformers_version": "4.41.1"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:88bc212525cf546f6dc9af27d06e097ee12e4ddaa2bbed69675d13163066c493
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:7359d6c57e34c545a9f3952bf43dc60b2fc43f7ebe754fb209384812cbde78d3
 size 990345064

runs/May22_22-02-20_808d2f83ffb7/events.out.tfevents.1716415346.808d2f83ffb7.34.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cb90bc2c774d0a6338316087177848c36e9288261405f0d8d3a17264f1bd2951
+size 22774

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 106,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 106
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 112,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 112
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:632fb37f26c11ce54425a706654d76ccf4d18dce77690df35d3cb93d9cfe1de6
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:7779e3cf56305174cfef88db29a59aa86ed87db8acd6be8d22ab2313dd55b7f6
 size 5304