shadow0017/shawgpt-ft

Files changed (3) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.7347
 ## Model description
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.5922        | 0.9231 | 3    | 3.9647          |
-| 4.0407        | 1.8462 | 6    | 3.4355          |
-| 3.4539        | 2.7692 | 9    | 2.9726          |
-| 2.235         | 4.0    | 13   | 2.5286          |
-| 2.6285        | 4.9231 | 16   | 2.2706          |
-| 2.3624        | 5.8462 | 19   | 2.1297          |
-| 2.1106        | 6.7692 | 22   | 1.9465          |
-| 1.4567        | 8.0    | 26   | 1.8036          |
-| 1.8325        | 8.9231 | 29   | 1.7450          |
-| 1.2682        | 9.2308 | 30   | 1.7347          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/Mistral-7B-Instruct-v0.2-GPTQ](https://huggingface.co/TheBloke/Mistral-7B-Instruct-v0.2-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7259
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 4.593         | 0.9231 | 3    | 3.9679          |
+| 4.0435        | 1.8462 | 6    | 3.4369          |
+| 3.4545        | 2.7692 | 9    | 2.9737          |
+| 2.24          | 4.0    | 13   | 2.5341          |
+| 2.6225        | 4.9231 | 16   | 2.2679          |
+| 2.325         | 5.8462 | 19   | 2.1317          |
+| 2.1105        | 6.7692 | 22   | 1.9456          |
+| 1.4489        | 8.0    | 26   | 1.7923          |
+| 1.8142        | 8.9231 | 29   | 1.7353          |
+| 1.2582        | 9.2308 | 30   | 1.7259          |
 ### Framework versions

runs/Nov24_11-34-18_c067d9f4437a/events.out.tfevents.1732448061.c067d9f4437a.613.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2c3100240fcf2b49cd0c9bcbf0596f7a55c3947626f57a9c74073d787d3962bf
+size 10800

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a4051300dd4781e03390a55850e190d0e28b2262f2e3d0f4487f1e9cab290d3
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:79b142f16ea9d1fabb13f5bcbcdd9fedd537595934e1156cc7a6291f9d00c36a
 size 5240