cwaud
/

test

Generated from Trainer

8-bit precision

Model card Files Files and versions Community

cwaud commited on Oct 6

Commit

1e81a9a

•

1 Parent(s): 32fc121

End of training

Files changed (2) hide show

README.md +5 -5
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -90,7 +90,7 @@ xformers_attention: null
 This model is a fine-tuned version of [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0050
 ## Model description
@@ -124,10 +124,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 4.8197        | 0.0042 | 1    | 4.6394          |
-| 4.6489        | 0.0126 | 3    | 4.5547          |
-| 4.0712        | 0.0253 | 6    | 2.9871          |
-| 1.3689        | 0.0379 | 9    | 1.0050          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/Llama-3.2-3B-Instruct](https://huggingface.co/unsloth/Llama-3.2-3B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1166
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 5.2097        | 0.0042 | 1    | 5.1479          |
+| 5.2103        | 0.0126 | 3    | 5.0747          |
+| 4.1538        | 0.0253 | 6    | 3.1044          |
+| 1.3549        | 0.0379 | 9    | 1.1166          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a0f5480285728eabc395dd7b2f6aa5f5e8e8ad30c8acb9171dd1a5c5b6cae604
 size 982663982

 version https://git-lfs.github.com/spec/v1
+oid sha256:8145ec4a4b61a66c658b1b9d738a3be4010db6dcccac967c6668e60dc7ef2cf3
 size 982663982