fionazhang commited on
Commit
df18562
1 Parent(s): 6b4b411

fionazhang/mistral-experiment-6

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.0203
22
 
23
  ## Model description
24
 
@@ -44,7 +44,7 @@ The following hyperparameters were used during training:
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
- - num_epochs: 1
48
 
49
  ### Training results
50
 
 
18
 
19
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.1400
22
 
23
  ## Model description
24
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: constant
46
  - lr_scheduler_warmup_ratio: 0.03
47
+ - num_epochs: 2
48
 
49
  ### Training results
50
 
adapter_config.json CHANGED
@@ -20,10 +20,10 @@
20
  "revision": null,
21
  "target_modules": [
22
  "o_proj",
23
- "k_proj",
24
  "gate_proj",
25
- "v_proj",
26
- "q_proj"
 
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
 
20
  "revision": null,
21
  "target_modules": [
22
  "o_proj",
 
23
  "gate_proj",
24
+ "q_proj",
25
+ "k_proj",
26
+ "v_proj"
27
  ],
28
  "task_type": "CAUSAL_LM"
29
  }
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8a35597ba87f48f26d72744e18209d52551a85b7132f4df95dabe30094e461c4
3
  size 23111352
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4084bac1af648b499f5aff5cfaa0442b653b5773c9715720e487156157d98843
3
  size 23111352
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:277e5baab97c340ab6fd277f05670c021e8a94d43fa7c5ea8c2537114bc3eaa3
3
  size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d34b253ed769e6b3ffd1a0af7920ff9dc1af6bb97a2425819231127b0e983954
3
  size 4664