rafaeloc15 commited on
Commit
842e829
1 Parent(s): 052687a

rafaeloc15/mistral-small-v5

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0424
24
 
25
  ## Model description
26
 
@@ -52,8 +52,8 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 0.0614 | 1.0 | 464 | 0.0561 |
56
- | 0.0464 | 2.0 | 928 | 0.0424 |
57
 
58
 
59
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0433
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 0.0618 | 1.0 | 401 | 0.0579 |
56
+ | 0.0471 | 2.0 | 802 | 0.0433 |
57
 
58
 
59
  ### Framework versions
adapter_config.json CHANGED
@@ -6,6 +6,7 @@
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
 
9
  "layers_pattern": null,
10
  "layers_to_transform": null,
11
  "loftq_config": {},
 
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
9
+ "layer_replication": null,
10
  "layers_pattern": null,
11
  "layers_to_transform": null,
12
  "loftq_config": {},
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d4d0e0dedc4c7916dad7e5c95d4f77a8593d7765902fc2cff0c4b8cae24366fb
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7a8d440b290a6a7163205813497d5cf349c36710ffb0eb1ff86df6dd8b239003
3
  size 109069176
runs/Apr15_11-56-17_a3c9d11ff418/events.out.tfevents.1713182276.a3c9d11ff418.668.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f40b3b610ff777ccc49351d340267d2ae1f10cf86c7597379b0f5d695527021
3
+ size 10034
runs/Apr15_11-56-17_a3c9d11ff418/events.out.tfevents.1713182422.a3c9d11ff418.668.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:abef5c83d593056e100e2e09a47a49870d856acff4dc2b4e0f35cd77273fb212
3
+ size 22789
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f830bdfd4b7fde5b8e907c86943d9c700ebbc3e58ddad3a0d333f9667dabcc6
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1a4a5e19f0f2e97a5899521a1e3351525fa6d1bd06675ca697824cbdf5b0eedf
3
  size 4920