rafaeloc15 commited on
Commit
e0172f3
1 Parent(s): 1e4e0c5

rafaeloc15/mistral-small-v5

Browse files
README.md CHANGED
@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0433
24
 
25
  ## Model description
26
 
@@ -52,8 +52,8 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
- | 0.0618 | 1.0 | 401 | 0.0579 |
56
- | 0.0471 | 2.0 | 802 | 0.0433 |
57
 
58
 
59
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [mistralai/Mistral-7B-Instruct-v0.2](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) on the generator dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0738
24
 
25
  ## Model description
26
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:-----:|:----:|:---------------:|
55
+ | 0.1033 | 1.0 | 82 | 0.0988 |
56
+ | 0.0809 | 2.0 | 164 | 0.0738 |
57
 
58
 
59
  ### Framework versions
adapter_config.json CHANGED
@@ -6,6 +6,7 @@
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
 
9
  "layers_pattern": null,
10
  "layers_to_transform": null,
11
  "loftq_config": {},
 
6
  "fan_in_fan_out": false,
7
  "inference_mode": true,
8
  "init_lora_weights": true,
9
+ "layer_replication": null,
10
  "layers_pattern": null,
11
  "layers_to_transform": null,
12
  "loftq_config": {},
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7a8d440b290a6a7163205813497d5cf349c36710ffb0eb1ff86df6dd8b239003
3
  size 109069176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:030fc4a3b11ca1f9daaf45430f913ce20e97bc6c9590d374f7f7a0a526cf0469
3
  size 109069176
runs/Apr18_11-19-03_eb7d68fc1d12/events.out.tfevents.1713439235.eb7d68fc1d12.465.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:805b7ee66d8c2c9e4003c3da2920a2f498cf1adce3e2998b49bf9914a598fbbc
3
+ size 9280
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1a4a5e19f0f2e97a5899521a1e3351525fa6d1bd06675ca697824cbdf5b0eedf
3
  size 4920
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da7e101c0c35f47975325d91caa6d28dc01850bfc3bf2bd7059b7710c2810cb3
3
  size 4920