End of training

Browse files

Files changed (4) hide show

README.md +21 -22
model.safetensors +1 -1
runs/May21_17-01-33_04500254204e/events.out.tfevents.1716310898.04500254204e.34.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.3524
 ## Model description
@@ -34,7 +34,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0005
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -48,26 +48,25 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 0.98  | 6    | 8.2912          |
-| 9.0416        | 1.96  | 12   | 6.3294          |
-| 9.0416        | 2.94  | 18   | 5.7787          |
-| 6.7966        | 3.92  | 24   | 5.4608          |
-| 6.5563        | 4.9   | 30   | 5.3249          |
-| 6.5563        | 5.88  | 36   | 5.1962          |
-| 6.2756        | 6.86  | 42   | 5.1209          |
-| 6.2756        | 8.0   | 49   | 4.9701          |
-| 6.3126        | 8.98  | 55   | 4.8793          |
-| 6.237         | 9.96  | 61   | 4.7837          |
-| 6.237         | 10.94 | 67   | 4.7102          |
-| 5.9722        | 11.92 | 73   | 4.5721          |
-| 5.9722        | 12.9  | 79   | 4.5170          |
-| 5.9883        | 13.88 | 85   | 4.4562          |
-| 5.8828        | 14.86 | 91   | 4.4168          |
-| 5.8828        | 16.0  | 98   | 4.3880          |
-| 5.8493        | 16.98 | 104  | 4.3684          |
-| 5.8112        | 17.96 | 110  | 4.3570          |
-| 5.8112        | 18.94 | 116  | 4.3528          |
-| 5.6628        | 19.59 | 120  | 4.3524          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6530
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.93  | 7    | 8.2816          |
+| 9.7888        | 2.0   | 15   | 6.6539          |
+| 8.253         | 2.93  | 22   | 5.9839          |
+| 7.4098        | 4.0   | 30   | 5.5296          |
+| 7.4098        | 4.93  | 37   | 5.1792          |
+| 6.6836        | 6.0   | 45   | 4.8581          |
+| 6.2698        | 6.93  | 52   | 4.6282          |
+| 5.8092        | 8.0   | 60   | 4.4243          |
+| 5.8092        | 8.93  | 67   | 4.2668          |
+| 5.3803        | 10.0  | 75   | 4.1214          |
+| 5.3501        | 10.93 | 82   | 4.0024          |
+| 5.1278        | 12.0  | 90   | 3.8835          |
+| 5.1278        | 12.93 | 97   | 3.8106          |
+| 4.9471        | 14.0  | 105  | 3.7422          |
+| 4.9279        | 14.93 | 112  | 3.7098          |
+| 4.8129        | 16.0  | 120  | 3.6740          |
+| 4.8129        | 16.93 | 127  | 3.6601          |
+| 4.7258        | 18.0  | 135  | 3.6535          |
+| 4.8643        | 18.67 | 140  | 3.6530          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b672c8c562af88d3224a2bbb5365350a8a62de023117f2d96c094b72b9299da4
 size 497777280

 version https://git-lfs.github.com/spec/v1
+oid sha256:6cdecdb980e769385dcf2d23c05a8a603b54635cf445984d61a3614e0acf77bc
 size 497777280

runs/May21_17-01-33_04500254204e/events.out.tfevents.1716310898.04500254204e.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3588bd4aa508e71cd922ca91148c9004d50631c387d9c80d1714988152ba1b36
+size 17926

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7d814d2398436b0823107338673b666c0f55a1ce0d76410bc11af67edf461fe6
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:eba9042156f687078e30db9d1e502d3bab2ab4e1fb58bc65f1be38afd512ad73
 size 4920