End of training

Browse files

Files changed (2) hide show

README.md +23 -14
runs/May10_12-00-23_186255fb755c/events.out.tfevents.1715343240.186255fb755c.10006.1 +3 -0

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3976
 ## Model description
@@ -44,23 +44,32 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 4.1739        | 0.9412 | 4    | 3.4659          |
-| 4.0195        | 1.8824 | 8    | 3.3893          |
-| 3.2322        | 2.8235 | 12   | 3.2363          |
-| 2.0625        | 4.0    | 17   | 3.0555          |
-| 2.2884        | 4.9412 | 21   | 2.9109          |
-| 2.1576        | 5.8824 | 25   | 2.7795          |
-| 2.0546        | 6.8235 | 29   | 2.6220          |
-| 1.5652        | 8.0    | 34   | 2.4955          |
-| 1.8893        | 8.9412 | 38   | 2.4207          |
-| 1.6427        | 9.4118 | 40   | 2.3976          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8239
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| 4.1739        | 0.9412  | 4    | 3.4659          |
+| 4.0196        | 1.8824  | 8    | 3.3864          |
+| 3.2283        | 2.8235  | 12   | 3.2071          |
+| 2.0147        | 4.0     | 17   | 2.9738          |
+| 2.2299        | 4.9412  | 21   | 2.7751          |
+| 2.0711        | 5.8824  | 25   | 2.6203          |
+| 1.946         | 6.8235  | 29   | 2.3797          |
+| 1.4262        | 8.0     | 34   | 2.0694          |
+| 1.6468        | 8.9412  | 38   | 1.8468          |
+| 1.5549        | 9.8824  | 42   | 1.5932          |
+| 1.4661        | 10.8235 | 46   | 1.3819          |
+| 1.1079        | 12.0    | 51   | 1.2555          |
+| 1.3114        | 12.9412 | 55   | 1.1306          |
+| 1.2436        | 13.8824 | 59   | 1.0515          |
+| 1.1965        | 14.8235 | 63   | 0.9581          |
+| 0.9269        | 16.0    | 68   | 0.9277          |
+| 1.1262        | 16.9412 | 72   | 0.8709          |
+| 1.1054        | 17.8824 | 76   | 0.8343          |
+| 0.8664        | 18.8235 | 80   | 0.8239          |
 ### Framework versions

runs/May10_12-00-23_186255fb755c/events.out.tfevents.1715343240.186255fb755c.10006.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:846cbb594d2e0b2a92355419c4361f89f2b7b936532ed7e646e88a5e90f8d2d9
+size 354