End of training

Browse files

Files changed (4) hide show

README.md +30 -13
model.safetensors +1 -1
runs/Jun03_05-16-53_9a93aa0ad6d9/events.out.tfevents.1717391815.9a93aa0ad6d9.730.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
-base_model: Aravindan/gpt2out
 model-index:
 - name: gpt2coder-8epochs
   results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9718
 ## Model description
@@ -43,20 +43,37 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 8
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 125  | 3.1181          |
-| No log        | 2.0   | 250  | 2.6411          |
-| No log        | 3.0   | 375  | 2.4035          |
-| 2.9711        | 4.0   | 500  | 2.2375          |
-| 2.9711        | 5.0   | 625  | 2.1289          |
-| 2.9711        | 6.0   | 750  | 2.0475          |
-| 2.9711        | 7.0   | 875  | 1.9931          |
-| 2.1959        | 8.0   | 1000 | 1.9718          |
 ### Framework versions

 ---
 license: mit
+base_model: Aravindan/gpt2out
 tags:
 - generated_from_trainer
 model-index:
 - name: gpt2coder-8epochs
   results: []
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9270
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 25
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| No log        | 0.9810  | 31   | 3.2508          |
+| No log        | 1.9937  | 63   | 2.6920          |
+| No log        | 2.9747  | 94   | 2.3769          |
+| No log        | 3.9873  | 126  | 2.1444          |
+| No log        | 5.0     | 158  | 1.9673          |
+| No log        | 5.9810  | 189  | 1.8320          |
+| No log        | 6.9937  | 221  | 1.7097          |
+| No log        | 7.9747  | 252  | 1.6159          |
+| No log        | 8.9873  | 284  | 1.5231          |
+| No log        | 10.0    | 316  | 1.4535          |
+| No log        | 10.9810 | 347  | 1.3788          |
+| No log        | 11.9937 | 379  | 1.3109          |
+| No log        | 12.9747 | 410  | 1.2496          |
+| No log        | 13.9873 | 442  | 1.1989          |
+| No log        | 14.9810 | 465  | 1.1647          |
+| No log        | 15.9937 | 497  | 1.1208          |
+| 1.3856        | 16.9747 | 528  | 1.0841          |
+| 1.3856        | 17.9873 | 560  | 1.0464          |
+| 1.3856        | 19.0    | 592  | 1.0180          |
+| 1.3856        | 19.9810 | 623  | 0.9928          |
+| 1.3856        | 20.9937 | 655  | 0.9689          |
+| 1.3856        | 21.9747 | 686  | 0.9517          |
+| 1.3856        | 22.9873 | 718  | 0.9390          |
+| 1.3856        | 24.0    | 750  | 0.9298          |
+| 1.3856        | 24.7911 | 775  | 0.9270          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1be145ec86717b1f33f14ab714d56a62366e204342bda2d6f6af94ceae0e1ec
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:d29ced89a6423e2315431bff8b01bf9452b60607d050fb307ccb939435a8470b
 size 497774208

runs/Jun03_05-16-53_9a93aa0ad6d9/events.out.tfevents.1717391815.9a93aa0ad6d9.730.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:084bc96bc5cb4664c3485bd3ff3d7ef14dee60c8bec873d09f34654eca785ff5
+size 8616

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6e9096652f666a5a9037023053ae1fa3ff8e596df2b1c46d324831ec3b6db17f
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:08ead8923aaaf05f683a0f272f9e8c5877c5d68113c2f6f34fbc69f833fffcc0
 size 5112