nitsw
/

mistral_axonotll

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

nitsw commited on Jan 23, 2024

Commit

9226f63

·

verified ·

1 Parent(s): 6bbf591

End of training

Files changed (2) hide show

README.md +24 -5
adapter_model.bin +2 -2

README.md CHANGED Viewed

@@ -6,7 +6,7 @@ tags:
 - generated_from_trainer
 base_model: mistralai/Mistral-7B-v0.1
 model-index:
-- name: mistral_axonotl
   results: []
 ---
@@ -27,9 +27,9 @@ load_in_8bit: false
 load_in_4bit: true
 strict: false
-hub_model_id: nitsw/mistral_axonotl
 datasets:
-  - path: mhenrichsen/alpaca_2k_test
     type: alpaca
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.1
@@ -56,7 +56,7 @@ lora_target_modules:
   - k_proj
   - o_proj
-wandb_project:
 wandb_entity:
 wandb_watch:
 wandb_name:
@@ -107,9 +107,11 @@ special_tokens:
 </details><br>
-# mistral_axonotl
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
 ## Model description
@@ -141,6 +143,23 @@ The following hyperparameters were used during training:
 ### Training results
 ### Framework versions

 - generated_from_trainer
 base_model: mistralai/Mistral-7B-v0.1
 model-index:
+- name: mistral_axonotll
   results: []
 ---
 load_in_4bit: true
 strict: false
+hub_model_id: nitsw/mistral_axonotll
 datasets:
+  - path: nitsw/alpaca_cleaned
     type: alpaca
 dataset_prepared_path: last_run_prepared
 val_set_size: 0.1
   - k_proj
   - o_proj
+wandb_project: swapnil_axolotl
 wandb_entity:
 wandb_watch:
 wandb_name:
 </details><br>
+# mistral_axonotll
 This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.8484
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.8523        | 0.06  | 10   | 0.8987          |
+| 0.8882        | 0.13  | 20   | 0.8766          |
+| 0.8374        | 0.19  | 30   | 0.8683          |
+| 0.8223        | 0.25  | 40   | 0.8636          |
+| 0.85          | 0.32  | 50   | 0.8604          |
+| 0.8425        | 0.38  | 60   | 0.8577          |
+| 0.8572        | 0.44  | 70   | 0.8560          |
+| 0.8427        | 0.51  | 80   | 0.8539          |
+| 0.8627        | 0.57  | 90   | 0.8526          |
+| 0.8242        | 0.63  | 100  | 0.8512          |
+| 0.8555        | 0.7   | 110  | 0.8501          |
+| 0.8348        | 0.76  | 120  | 0.8495          |
+| 0.8593        | 0.83  | 130  | 0.8488          |
+| 0.8403        | 0.89  | 140  | 0.8485          |
+| 0.8628        | 0.95  | 150  | 0.8484          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:be7988924d63fa823b4ec6a962a798fb2d6a3847f667cfb2f67f2079cf261253
-size 100143104

 version https://git-lfs.github.com/spec/v1
+oid sha256:30f3a3dae4afd97fd3a67fd4f5594a28dc463f575cb0a49a4e42ed6b0a0e64f3
+size 335705741