End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0148
-- Exact Match: 16.6
-- Gen Len: 3.997
 ## Model description
@@ -42,13 +42,16 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Exact Match | Gen Len |
-|:-------------:|:-----:|:----:|:---------------:|:-----------:|:-------:|
-| No log        | 1.0   | 125  | 1.0148          | 16.6        | 3.997   |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8087
+- Exact Match: 17.1569
+- Gen Len: 4.0
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Exact Match | Gen Len |
+|:-------------:|:-----:|:-----:|:---------------:|:-----------:|:-------:|
+| 0.8486        | 1.0   | 4246  | 0.8367          | 30.085      | 3.0     |
+| 0.8316        | 2.0   | 8492  | 0.8192          | 17.1569     | 4.0     |
+| 0.8266        | 3.0   | 12738 | 0.8136          | 17.1569     | 4.0     |
+| 0.818         | 4.0   | 16984 | 0.8087          | 17.1569     | 4.0     |
 ### Framework versions

logs/events.out.tfevents.1716406930.Chris_PC.15220.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3ecd06ff79fd58172c8fd5ed020aa2388894e4434ab503d10c3af55586ae74f9
+size 14677

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a5b1746e3c3a6d01324ea95ba56df33d9d0df54985fbdd65abf371961911e248
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:d3870a4602bddd8c32b606532b54ac3ff0719c22a29b57fb1e16b9a834c97d64
 size 990345064

tokenizer.json CHANGED Viewed

@@ -2,13 +2,13 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 9,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
-      "Fixed": 9
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 10,
     "strategy": "LongestFirst",
     "stride": 0
   },
   "padding": {
     "strategy": {
+      "Fixed": 10
     },
     "direction": "Right",
     "pad_to_multiple_of": null,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c645f254ab314b93c8da4a9c8d345161a133ab2a4dd79f695e68be3c600ba976
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:39a43dd6dbc5117c777bb057f2713e2fdd038819a6dfa4f153acecf3f8bc4cca
 size 5304