vxbrandon commited on
Commit
0fad552
1 Parent(s): 3ce8ee5

End of training

Browse files
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.1311
19
 
20
  ## Model description
21
 
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
45
  - total_eval_batch_size: 4
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
- - training_steps: 3250
49
 
50
  ### Training results
51
 
@@ -181,6 +181,26 @@ The following hyperparameters were used during training:
181
  | 2.2584 | 0.51 | 3200 | 2.4062 |
182
  | 2.1848 | 0.52 | 3225 | 2.4075 |
183
  | 2.1779 | 0.52 | 3250 | 2.4066 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
184
 
185
 
186
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.1297
19
 
20
  ## Model description
21
 
 
45
  - total_eval_batch_size: 4
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
+ - training_steps: 3750
49
 
50
  ### Training results
51
 
 
181
  | 2.2584 | 0.51 | 3200 | 2.4062 |
182
  | 2.1848 | 0.52 | 3225 | 2.4075 |
183
  | 2.1779 | 0.52 | 3250 | 2.4066 |
184
+ | 2.2542 | 0.52 | 3275 | 2.4041 |
185
+ | 2.2406 | 0.53 | 3300 | 2.4066 |
186
+ | 2.1247 | 0.53 | 3325 | 2.4023 |
187
+ | 2.2576 | 0.54 | 3350 | 2.4041 |
188
+ | 2.1636 | 0.54 | 3375 | 2.4023 |
189
+ | 2.1781 | 0.54 | 3400 | 2.4056 |
190
+ | 2.1949 | 0.55 | 3425 | 2.4047 |
191
+ | 2.1119 | 0.55 | 3450 | 2.4070 |
192
+ | 2.2437 | 0.56 | 3475 | 2.4096 |
193
+ | 2.281 | 0.56 | 3500 | 2.4040 |
194
+ | 2.2499 | 0.56 | 3525 | 2.4063 |
195
+ | 2.2129 | 0.57 | 3550 | 2.4052 |
196
+ | 2.2115 | 0.57 | 3575 | 2.4050 |
197
+ | 2.375 | 0.58 | 3600 | 2.4050 |
198
+ | 2.1891 | 0.58 | 3625 | 2.4082 |
199
+ | 2.3929 | 0.58 | 3650 | 2.4038 |
200
+ | 2.1928 | 0.59 | 3675 | 2.4079 |
201
+ | 2.3194 | 0.59 | 3700 | 2.4067 |
202
+ | 2.2286 | 0.6 | 3725 | 2.4086 |
203
+ | 2.1629 | 0.6 | 3750 | 2.4058 |
204
 
205
 
206
  ### Framework versions
model-00001-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5b233940374f25e0fbf25b4e4ea26d848bad5e5c018dab5dfc8cc6d0d5721dd0
3
  size 4943163992
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbae751d3da17f6a1ab2c42e3bcdd1f7dac483d72696b15ff0c5af6a8750a0ee
3
  size 4943163992
model-00002-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:67192b8d3bd11e37f666ed2643b0c03c330e29ff089245d46ade68210d4665a3
3
  size 4999821144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e78fb4db971b80258a944ccdb6b10ca4910a0397f186297e0533c9fa486a2c7
3
  size 4999821144
model-00003-of-00003.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1b8772eba1cbe9c38550239659dd80185a7b27a76439c5e72edd4adedd1c817a
3
  size 4540517840
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9232e1fb567fbdfcb8ca17ab94b9c56195246778aa54bbb0b60d9c226cd8fa7e
3
  size 4540517840