Ai-Marshal commited on
Commit
fd09bfa
1 Parent(s): a7851e5

Ai-Marshal/12.5k_A100_5epoch_adapter

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.5052
24
 
25
  ## Model description
26
 
@@ -52,11 +52,11 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 0.4994 | 1.0 | 559 | 0.5309 |
56
- | 0.5643 | 2.0 | 1118 | 0.5145 |
57
- | 0.5366 | 3.0 | 1677 | 0.5087 |
58
- | 0.4885 | 4.0 | 2236 | 0.5059 |
59
- | 0.489 | 5.0 | 2795 | 0.5052 |
60
 
61
 
62
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mixtral-8x7B-Instruct-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.5049
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 0.5879 | 1.0 | 559 | 0.5304 |
56
+ | 0.5272 | 2.0 | 1118 | 0.5142 |
57
+ | 0.5564 | 3.0 | 1677 | 0.5083 |
58
+ | 0.5207 | 4.0 | 2236 | 0.5057 |
59
+ | 0.5123 | 5.0 | 2795 | 0.5049 |
60
 
61
 
62
  ### Framework versions
adapter_config.json CHANGED
@@ -19,13 +19,13 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
 
22
  "k_proj",
23
  "o_proj",
24
- "q_proj",
25
  "up_proj",
26
- "v_proj",
27
- "lm_head",
28
  "down_proj",
 
 
29
  "gate_proj"
30
  ],
31
  "task_type": "CAUSAL_LM",
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "lm_head",
23
  "k_proj",
24
  "o_proj",
 
25
  "up_proj",
 
 
26
  "down_proj",
27
+ "v_proj",
28
+ "q_proj",
29
  "gate_proj"
30
  ],
31
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6e23de478b88b30d97416b23ab7b3f1210371fd0a7ca3c63be82f82bb87d49d1
3
  size 581159464
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:685cb2b722bc04b4f8ddc062061a6e7f943745a8cc49381d7b521d8eab8a281d
3
  size 581159464
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0bfd2215035f93c2441bdf425a24608949c5a667395d9c8b1e60c608c591928d
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b8bb05001bdd74040f72341ff81896ea930af295a4dee9218828ce6e3d84218d
3
  size 4920