liuylhf
/

mixtral-lora-less-modules

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

liuylhf commited on Feb 29

Commit

7c01285

•

1 Parent(s): 8a37e5d

End of training

Files changed (2) hide show

README.md +20 -1
adapter_model.bin +2 -2

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
@@ -111,7 +112,9 @@ fsdp_config:
 # mixtral-lora-less-modules
-This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on an unknown dataset.
 ## Model description
@@ -144,6 +147,22 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
 ### Framework versions
 - PEFT 0.8.2

 license: apache-2.0
 library_name: peft
 tags:
+- axolotl
 - generated_from_trainer
 base_model: mistralai/Mixtral-8x7B-Instruct-v0.1
 model-index:
 # mixtral-lora-less-modules
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1911
 ## Model description
 - lr_scheduler_warmup_steps: 10
 - num_epochs: 1
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 3.2966        | 0.0   | 1    | 3.2222          |
+| 0.261         | 0.1   | 31   | 0.2720          |
+| 0.1428        | 0.2   | 62   | 0.2252          |
+| 0.2674        | 0.3   | 93   | 0.2108          |
+| 0.1767        | 0.4   | 124  | 0.2043          |
+| 0.105         | 0.5   | 155  | 0.2003          |
+| 0.1799        | 0.6   | 186  | 0.1958          |
+| 0.1528        | 0.7   | 217  | 0.1942          |
+| 0.1954        | 0.8   | 248  | 0.1917          |
+| 0.1821        | 0.9   | 279  | 0.1911          |
 ### Framework versions
 - PEFT 0.8.2

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a1d3a819fc09188e77e001a02e4d76369078a73386bcd5cc3d3b6c0d16f05425
-size 969596450

 version https://git-lfs.github.com/spec/v1
+oid sha256:f3db019c05048d9178cfadcc93e13bfb87e58e8b2a7d7b2df5ca10a9014b162f
+size 218196746