End of training

Files changed (7) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 6.9819
 ## Model description
@@ -42,20 +42,25 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- num_epochs: 2
-- mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.8649 | 4    | 7.5810          |
-| No log        | 1.7297 | 8    | 6.9819          |
 ### Framework versions
-- Transformers 4.40.2
-- Pytorch 2.2.1+cu121
-- Datasets 2.19.1
-- Tokenizers 0.19.1

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 6.4423
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- num_epochs: 8
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| No log        | 0.96  | 3    | 7.7950          |
+| No log        | 1.92  | 6    | 6.9879          |
+| No log        | 2.88  | 9    | 6.6631          |
+| 7.658         | 3.84  | 12   | 6.5423          |
+| 7.658         | 4.8   | 15   | 6.4882          |
+| 7.658         | 5.76  | 18   | 6.4637          |
+| 6.5857        | 6.72  | 21   | 6.4457          |
+| 6.5857        | 7.68  | 24   | 6.4423          |
 ### Framework versions
+- Transformers 4.39.3
+- Pytorch 2.1.2
+- Datasets 2.18.0
+- Tokenizers 0.15.2

config.json CHANGED Viewed

@@ -33,7 +33,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.40.2",
   "use_cache": true,
   "vocab_size": 50258
 }

     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.39.3",
   "use_cache": true,
   "vocab_size": 50258
 }

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
-  "transformers_version": "4.40.2"
 }

   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
+  "transformers_version": "4.39.3"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6bd9a3eebead594209b9d8818a947bb990f46a3ce61f2d35b98ed8110be8f30a
 size 497777280

 version https://git-lfs.github.com/spec/v1
+oid sha256:933699a9b7c2f094e47eeccbaf9a51566ed49e2e27c25012e374b0123bcc1316
 size 497777280

runs/May21_08-11-04_e261ca047b2e/events.out.tfevents.1716279078.e261ca047b2e.34.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:19a9ff1ee65e09ee4641fa12185d27508832b51e105de6997b38d5f9bcba6d97
+size 10480

runs/May21_08-13-47_e261ca047b2e/events.out.tfevents.1716279233.e261ca047b2e.34.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:27aba0d86fef8b9e26cf4776ba1c8c65b65b3cc2734f970b191c5d1327e2b6b6
+size 12490

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7dece0d76e32e34ffbcfa4e84382a962095709d6be483fba9b728b08e9e5f536
-size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:1fd184e8ac67157d06e0ba504579693b3ef687794176186fc2d9a4a5615154b5
+size 4920