liuylhf
/

special-token-all-linear

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

liuylhf commited on Apr 8

Commit

91073c8

•

1 Parent(s): e15bd06

End of training

Files changed (2) hide show

README.md +15 -1
adapter_model.bin +3 -0

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -88,7 +89,9 @@ weight_decay: 0.0
 # special-token-all-linear
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -121,6 +124,17 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4
 ### Framework versions
 - PEFT 0.9.0

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # special-token-all-linear
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.0801
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 4
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 2.1829        | 0.01  | 1    | 2.1038          |
+| 0.091         | 0.8   | 151  | 0.0832          |
+| 0.0741        | 1.58  | 302  | 0.0801          |
+| 0.0687        | 2.36  | 453  | 0.0801          |
+| 0.0654        | 3.14  | 604  | 0.0801          |
 ### Framework versions
 - PEFT 0.9.0

adapter_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c7dbc37d9ba8754f50c194b15eddf83d86fb7118c5b9a827bf71806f2d3eb8af
+size 1938497058