End of training

Browse files

Files changed (4) hide show

README.md +23 -14
model.safetensors +1 -1
runs/Jul07_11-51-34_Noah-Desktop/events.out.tfevents.1720371095.Noah-Desktop.21620.0 +2 -2
runs/Jul07_11-51-34_Noah-Desktop/events.out.tfevents.1720380841.Noah-Desktop.21620.1 +3 -0

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
-base_model: NowaBwagel0/llama-68m-oasst
 license: other
 tags:
 - generated_from_trainer
 model-index:
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.4908
 ## Model description
@@ -42,21 +42,30 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 9
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 1.1068        | 0.9987 | 382  | 3.3211          |
-| 1.0575        | 2.0    | 765  | 3.3505          |
-| 1.0458        | 2.9987 | 1147 | 3.3795          |
-| 1.0294        | 4.0    | 1530 | 3.4092          |
-| 0.999         | 4.9987 | 1912 | 3.4321          |
-| 0.9847        | 6.0    | 2295 | 3.4528          |
-| 0.9233        | 6.9987 | 2677 | 3.4730          |
-| 0.9068        | 8.0    | 3060 | 3.4844          |
-| 0.9217        | 8.9882 | 3438 | 3.4908          |
 ### Framework versions

 ---
 license: other
+base_model: NowaBwagel0/llama-68m-oasst
 tags:
 - generated_from_trainer
 model-index:
 This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.8987
 ## Model description
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 18
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| 0.97          | 0.9987  | 382  | 3.4996          |
+| 0.9273        | 2.0     | 765  | 3.5370          |
+| 0.9176        | 2.9987  | 1147 | 3.5715          |
+| 0.9004        | 4.0     | 1530 | 3.6086          |
+| 0.8736        | 4.9987  | 1912 | 3.6379          |
+| 0.8599        | 6.0     | 2295 | 3.6761          |
+| 0.7955        | 6.9987  | 2677 | 3.7044          |
+| 0.7741        | 8.0     | 3060 | 3.7346          |
+| 0.7364        | 8.9987  | 3442 | 3.7615          |
+| 0.7605        | 10.0    | 3825 | 3.7855          |
+| 0.695         | 10.9987 | 4207 | 3.8088          |
+| 0.7111        | 12.0    | 4590 | 3.8332          |
+| 0.6849        | 12.9987 | 4972 | 3.8490          |
+| 0.6862        | 14.0    | 5355 | 3.8659          |
+| 0.6834        | 14.9987 | 5737 | 3.8785          |
+| 0.6541        | 16.0    | 6120 | 3.8898          |
+| 0.646         | 16.9987 | 6502 | 3.8961          |
+| 0.6777        | 17.9765 | 6876 | 3.8987          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:aaf875e36658089a60ad3e8da039a9e1ed7c9f1e46355956e35fd1afac8d25b9
 size 272123144

 version https://git-lfs.github.com/spec/v1
+oid sha256:b527f61bd65e7a2895b0da8e4c2e292d9c4f082fc19099ce6bf51534fc020ca2
 size 272123144

runs/Jul07_11-51-34_Noah-Desktop/events.out.tfevents.1720371095.Noah-Desktop.21620.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b8fe0eb1403ce04f635d8c5a50e504178a3d082e2c66b31ee104229dd71dbac1
-size 180508

 version https://git-lfs.github.com/spec/v1
+oid sha256:51bde0d9379b1636d9aebe0ec6c1cee6e366a47f7881952f89404e94120bf711
+size 191321

runs/Jul07_11-51-34_Noah-Desktop/events.out.tfevents.1720380841.Noah-Desktop.21620.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:08f042ac002bdd834ef2816b31c9d0b24e8102d6ae8a7f2d6681fba71b71bbdf
+size 359