rafaeloc15 commited on
Commit
f72d007
1 Parent(s): bada4e6

rafaeloc15/mistral-small-v5

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0370
24
 
25
  ## Model description
26
 
@@ -52,8 +52,8 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 0.0516 | 1.0 | 268 | 0.0495 |
56
- | 0.0393 | 2.0 | 536 | 0.0370 |
57
 
58
 
59
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0424
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 0.0614 | 1.0 | 464 | 0.0561 |
56
+ | 0.0464 | 2.0 | 928 | 0.0424 |
57
 
58
 
59
  ### Framework versions
adapter_config.json CHANGED
@@ -6,6 +6,7 @@
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
 
9
  "layers_pattern": null,
10
  "layers_to_transform": null,
11
  "loftq_config": {},
 
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
9
+ "layer_replication": null,
10
  "layers_pattern": null,
11
  "layers_to_transform": null,
12
  "loftq_config": {},
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d6c0cde07c559318e0dc4af4c3a278cc3547384e114ac2aa9aa78216f957ed1d
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4d0e0dedc4c7916dad7e5c95d4f77a8593d7765902fc2cff0c4b8cae24366fb
3
  size 109069176
runs/Apr12_15-13-53_e6c5c418f5fd/events.out.tfevents.1712935104.e6c5c418f5fd.162.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98cc06420640832490b123ce6e1dbbece5661dfff0cb99ad568b1a9d005f352a
3
+ size 5061
runs/Apr12_15-37-39_e6c5c418f5fd/events.out.tfevents.1712936376.e6c5c418f5fd.162.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d56e090edf12e65ec580f72097e019014d43123d6acaae43187a2d722d27cc55
3
+ size 25321
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:56b2d091aa5702b516be959e5dd70db67e57a0104418c5551125ccf39e7f7272
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f830bdfd4b7fde5b8e907c86943d9c700ebbc3e58ddad3a0d333f9667dabcc6
3
  size 4920