End of training

Browse files

Files changed (4) hide show

README.md +12 -22
model.safetensors +1 -1
runs/Jun03_10-19-58_iit-p/events.out.tfevents.1717390202.iit-p.5413.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2471
 ## Model description
@@ -40,32 +40,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 159  | 0.1309          |
-| No log        | 2.0   | 318  | 0.1204          |
-| No log        | 3.0   | 477  | 0.1239          |
-| 0.171         | 4.0   | 636  | 0.1199          |
-| 0.171         | 5.0   | 795  | 0.1251          |
-| 0.171         | 6.0   | 954  | 0.1378          |
-| 0.0759        | 7.0   | 1113 | 0.1458          |
-| 0.0759        | 8.0   | 1272 | 0.1524          |
-| 0.0759        | 9.0   | 1431 | 0.1609          |
-| 0.0477        | 10.0  | 1590 | 0.1746          |
-| 0.0477        | 11.0  | 1749 | 0.1811          |
-| 0.0477        | 12.0  | 1908 | 0.1888          |
-| 0.0296        | 13.0  | 2067 | 0.1953          |
-| 0.0296        | 14.0  | 2226 | 0.2104          |
-| 0.0296        | 15.0  | 2385 | 0.2095          |
-| 0.0194        | 16.0  | 2544 | 0.2175          |
-| 0.0194        | 17.0  | 2703 | 0.2319          |
-| 0.0194        | 18.0  | 2862 | 0.2422          |
-| 0.0129        | 19.0  | 3021 | 0.2430          |
-| 0.0129        | 20.0  | 3180 | 0.2471          |
 ### Framework versions

 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1839
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 159  | 0.1390          |
+| No log        | 2.0   | 318  | 0.1345          |
+| No log        | 3.0   | 477  | 0.1320          |
+| 0.1669        | 4.0   | 636  | 0.1397          |
+| 0.1669        | 5.0   | 795  | 0.1503          |
+| 0.1669        | 6.0   | 954  | 0.1546          |
+| 0.0764        | 7.0   | 1113 | 0.1623          |
+| 0.0764        | 8.0   | 1272 | 0.1694          |
+| 0.0764        | 9.0   | 1431 | 0.1789          |
+| 0.0518        | 10.0  | 1590 | 0.1839          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:858ad23a66e211aa7a1d6d2fda79eb21ed095777766810959f8c3b3c0fa930f5
 size 990345064

 version https://git-lfs.github.com/spec/v1
+oid sha256:c4ce9bdd7d79dec721338f31ecb7f4caed39b8a1ea6641bcfa7a4826fc52a76e
 size 990345064

runs/Jun03_10-19-58_iit-p/events.out.tfevents.1717390202.iit-p.5413.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d0f22cc065e02250e1438232431d283c3411669a94c6823d9d2a28f3d7f67d0b
+size 9544

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cae438fafc4353d19ef9d665b3fa14bc059fb086ee49be5df52bb6b559b01338
 size 4795

 version https://git-lfs.github.com/spec/v1
+oid sha256:39f9ed738bff14e79306f3207f2b2e0a183290507063f65953ffbffe5dab2918
 size 4795