DeepDream2045
/

65dfc289-bfac-4c47-814e-89cc9b3db974

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on Dec 13, 2024

Commit

3abfbc1

·

verified ·

1 Parent(s): 77b9e0a

End of training

Files changed (2) hide show

README.md +4 -4
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -98,7 +98,7 @@ xformers_attention: true
 This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-64k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-64k) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.1861
 ## Model description
@@ -135,9 +135,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 32.5319       | 0.0004 | 1    | 2.2997          |
-| 45.1928       | 0.0088 | 25   | 2.1923          |
-| 45.6105       | 0.0176 | 50   | 2.1861          |
 ### Framework versions

 This model is a fine-tuned version of [NousResearch/Yarn-Mistral-7b-64k](https://huggingface.co/NousResearch/Yarn-Mistral-7b-64k) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1862
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 32.5311       | 0.0004 | 1    | 2.2997          |
+| 45.2215       | 0.0088 | 25   | 2.1925          |
+| 45.6407       | 0.0176 | 50   | 2.1862          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:abffa9b619c8db3bcba2ab08587557f87865398b50856ef9bf665d77c2e76682
 size 335706186

 version https://git-lfs.github.com/spec/v1
+oid sha256:78bd26063133246ba38d2644d40a3136e2e23858f6e18281d1fbbeeeb094bff2
 size 335706186