cnatale commited on
Commit
c6db32f
1 Parent(s): 4a696d6

cnatale/Mistral-7B-Instruct-v0_1-Txt-2-Presto-SQL

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.6492
24
 
25
  ## Model description
26
 
@@ -53,14 +53,14 @@ The following hyperparameters were used during training:
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
- | 1.3538 | 0.71 | 10 | 1.0781 |
57
- | 1.0194 | 1.43 | 20 | 0.8769 |
58
- | 0.8536 | 2.14 | 30 | 0.7753 |
59
- | 0.7679 | 2.86 | 40 | 0.7234 |
60
- | 0.7069 | 3.57 | 50 | 0.6825 |
61
- | 0.6551 | 4.29 | 60 | 0.6609 |
62
- | 0.6105 | 5.0 | 70 | 0.6500 |
63
- | 0.582 | 5.71 | 80 | 0.6492 |
64
 
65
 
66
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.6454
24
 
25
  ## Model description
26
 
 
53
 
54
  | Training Loss | Epoch | Step | Validation Loss |
55
  |:-------------:|:-----:|:----:|:---------------:|
56
+ | 1.3516 | 0.71 | 10 | 1.0778 |
57
+ | 1.0131 | 1.43 | 20 | 0.8707 |
58
+ | 0.8446 | 2.14 | 30 | 0.7695 |
59
+ | 0.7563 | 2.86 | 40 | 0.7202 |
60
+ | 0.7009 | 3.57 | 50 | 0.6803 |
61
+ | 0.6368 | 4.29 | 60 | 0.6585 |
62
+ | 0.6201 | 5.0 | 70 | 0.6473 |
63
+ | 0.5755 | 5.71 | 80 | 0.6454 |
64
 
65
 
66
  ### Framework versions
adapter_config.json CHANGED
@@ -19,8 +19,8 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "q_proj",
23
- "v_proj"
24
  ],
25
  "task_type": "CAUSAL_LM"
26
  }
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "v_proj",
23
+ "q_proj"
24
  ],
25
  "task_type": "CAUSAL_LM"
26
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b8f26148e9e661fec0c4b657691cbcfb9733a0f254409201c25521480959bf8f
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:67cabdf2dbfcd4a924bb55b783d558b17698211420e7ace59932948718746a1f
3
  size 109069176
runs/Jan13_00-07-42_bbd6c80d4fe2/events.out.tfevents.1705104463.bbd6c80d4fe2.2665.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e61ccca9fc897a1ccb2b3de2dee6c83eb62e7294583214062581fd91a03d0004
3
+ size 8532
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4066029d7ea66d0b242cd1589af8c21eadd9bcab52c58156120d6fce0fe8a3c5
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9975a8fe85e9f8d76aac5496d0755a70e0261aee255c85e2f67fe651415f3ab5
3
  size 4728