Ehraim commited on
Commit
e155d7c
·
1 Parent(s): fa34806

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.5829
19
 
20
  ## Model description
21
 
@@ -49,8 +49,8 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | No log | 0.97 | 25 | 0.6393 |
53
- | No log | 1.94 | 50 | 0.5829 |
54
 
55
 
56
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6134
19
 
20
  ## Model description
21
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | No log | 0.97 | 25 | 0.6735 |
53
+ | No log | 1.94 | 50 | 0.6134 |
54
 
55
 
56
  ### Framework versions
adapter_config.json CHANGED
@@ -19,11 +19,11 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "o_proj",
23
  "q_proj",
 
24
  "k_proj",
25
- "gate_proj",
26
- "v_proj"
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
 
22
  "q_proj",
23
+ "o_proj",
24
  "k_proj",
25
+ "v_proj",
26
+ "gate_proj"
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:838876c8a68b6a64539aa50fec811f85730cca94c598c518ee9203ee484b52a5
3
  size 92317600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af5110712fa8a96d39557f4cd639354261686675654b9a7dc06f8dcd0c1cc0b1
3
  size 92317600
runs/Dec01_18-58-31_0625fec9c590/events.out.tfevents.1701457126.0625fec9c590.1001.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7c2ebad4ecc3fc7bd56be6dd3cb57a8a6367d8346a06b51286ac2dda7c194a4c
3
+ size 4611
runs/Dec01_19-15-50_0625fec9c590/events.out.tfevents.1701458171.0625fec9c590.1001.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a50e1f0238d4ba3289e76836f839cc3793543ac9d3cde4c55db154bfc55f6b72
3
+ size 4611
runs/Dec01_19-26-42_afc9af6b3b79/events.out.tfevents.1701458818.afc9af6b3b79.5965.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bd29c66b86e3809e779618c40f03ec8015a0d9b75503e1db89daf2b87e62edba
3
+ size 5491
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec1ee81c415ac93ec9f2ce45ca4cfec483db096c1b02af0bd76bfb738600adf2
3
  size 4600
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b4914981eeeb28cdc048efac10b994db593cdfb96c6a74207252ee0aa57187a
3
  size 4600