Promptengineering/mistral-instruct-generation_updated

Files changed (6) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1519
 ## Model description
@@ -46,22 +46,21 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
-- training_steps: 200
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.1307        | 1.11  | 20   | 1.0842          |
-| 0.6826        | 2.22  | 40   | 0.6897          |
-| 0.4163        | 3.33  | 60   | 0.3940          |
-| 0.2125        | 4.44  | 80   | 0.2545          |
-| 0.1186        | 5.56  | 100  | 0.1905          |
-| 0.085         | 6.67  | 120  | 0.1641          |
-| 0.0693        | 7.78  | 140  | 0.1562          |
-| 0.0604        | 8.89  | 160  | 0.1516          |
-| 0.0516        | 10.0  | 180  | 0.1436          |
-| 0.044         | 11.11 | 200  | 0.1519          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7713
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_steps: 0.03
+- training_steps: 180
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.7443        | 1.11  | 20   | 0.7713          |
+| 0.7413        | 2.22  | 40   | 0.7713          |
+| 0.7476        | 3.33  | 60   | 0.7713          |
+| 0.753         | 4.44  | 80   | 0.7713          |
+| 0.7514        | 5.56  | 100  | 0.7713          |
+| 0.7383        | 6.67  | 120  | 0.7713          |
+| 0.7434        | 7.78  | 140  | 0.7713          |
+| 0.7497        | 8.89  | 160  | 0.7713          |
+| 0.7644        | 10.0  | 180  | 0.7713          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_rslora": false

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_rslora": false

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7522f58b71980000b6dc878762ca1909d145175d4adbb5019ed870aa7c874856
-size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:e44ce263e6fd885f50d82ca515b9325375b43ee36ededb75acf161ce88bc2e41
+size 48

runs/Feb11_20-21-19_ab6695f28bb2/events.out.tfevents.1707682894.ab6695f28bb2.1842.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:feb065ac8b33f4a20333f322485a8da33904ac0ada7dc8235a5c75a40a22e24c
+size 8001

runs/Feb11_20-30-25_ab6695f28bb2/events.out.tfevents.1707683432.ab6695f28bb2.1842.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0ae23d4086ae985c16acc64b667af40979657f845ab7e42ca86d201cbf9770be
+size 10336

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:869689526706ca2d6ec02fc679c66aa0b1768a5077f1208ea0270d568579b091
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:f621554be607a891ff3f8d52ab2b5608c0ac5dadb6bce7262c9d4952a0629489
 size 4728