andrewAmani commited on
Commit
af833af
1 Parent(s): 020dedf

Model save

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [hivaze/ParaLex-Llama-3-8B-SFT](https://huggingface.co/hivaze/ParaLex-Llama-3-8B-SFT) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.3215
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 0.0003
38
  - train_batch_size: 1
39
  - eval_batch_size: 8
40
  - seed: 42
@@ -48,12 +48,12 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 0.4022 | 1.25 | 5 | 0.3324 |
52
- | 0.3492 | 2.5 | 10 | 0.3161 |
53
- | 0.3181 | 3.75 | 15 | 0.3138 |
54
- | 0.2808 | 5.0 | 20 | 0.3177 |
55
- | 0.2571 | 6.25 | 25 | 0.3206 |
56
- | 0.2424 | 7.5 | 30 | 0.3215 |
57
 
58
 
59
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [hivaze/ParaLex-Llama-3-8B-SFT](https://huggingface.co/hivaze/ParaLex-Llama-3-8B-SFT) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.8083
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 2e-05
38
  - train_batch_size: 1
39
  - eval_batch_size: 8
40
  - seed: 42
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 7.3306 | 1.25 | 5 | 5.9428 |
52
+ | 5.4669 | 2.5 | 10 | 4.3334 |
53
+ | 4.0282 | 3.75 | 15 | 3.1156 |
54
+ | 2.9271 | 5.0 | 20 | 2.3114 |
55
+ | 2.3074 | 6.25 | 25 | 1.9202 |
56
+ | 1.9795 | 7.5 | 30 | 1.8083 |
57
 
58
 
59
  ### Framework versions