cnatale/Mistral-7B-Instruct-v0_1-Txt-2-Presto-SQL

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6492
 ## Model description
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 1.3538        | 0.71  | 10   | 1.0781          |
-| 1.0194        | 1.43  | 20   | 0.8769          |
-| 0.8536        | 2.14  | 30   | 0.7753          |
-| 0.7679        | 2.86  | 40   | 0.7234          |
-| 0.7069        | 3.57  | 50   | 0.6825          |
-| 0.6551        | 4.29  | 60   | 0.6609          |
-| 0.6105        | 5.0   | 70   | 0.6500          |
-| 0.582         | 5.71  | 80   | 0.6492          |
 ### Framework versions

 This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6454
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.3516        | 0.71  | 10   | 1.0778          |
+| 1.0131        | 1.43  | 20   | 0.8707          |
+| 0.8446        | 2.14  | 30   | 0.7695          |
+| 0.7563        | 2.86  | 40   | 0.7202          |
+| 0.7009        | 3.57  | 50   | 0.6803          |
+| 0.6368        | 4.29  | 60   | 0.6585          |
+| 0.6201        | 5.0   | 70   | 0.6473          |
+| 0.5755        | 5.71  | 80   | 0.6454          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -19,8 +19,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM"
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b8f26148e9e661fec0c4b657691cbcfb9733a0f254409201c25521480959bf8f
 size 109069176

 version https://git-lfs.github.com/spec/v1
+oid sha256:67cabdf2dbfcd4a924bb55b783d558b17698211420e7ace59932948718746a1f
 size 109069176

runs/Jan13_00-07-42_bbd6c80d4fe2/events.out.tfevents.1705104463.bbd6c80d4fe2.2665.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:e61ccca9fc897a1ccb2b3de2dee6c83eb62e7294583214062581fd91a03d0004
+size 8532

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4066029d7ea66d0b242cd1589af8c21eadd9bcab52c58156120d6fce0fe8a3c5
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:9975a8fe85e9f8d76aac5496d0755a70e0261aee255c85e2f67fe651415f3ab5
 size 4728