ytcheng commited on
Commit
7b1f212
1 Parent(s): 0c21bb8

Model save

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [ytcheng/llama-3-8b-hf-sm-lora-merged](https://huggingface.co/ytcheng/llama-3-8b-hf-sm-lora-merged) on the generator dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 3.0057
23
 
24
  ## Model description
25
 
@@ -52,16 +52,16 @@ The following hyperparameters were used during training:
52
 
53
  ### Training results
54
 
55
- | Training Loss | Epoch | Step | Validation Loss |
56
- |:-------------:|:-----:|:----:|:---------------:|
57
- | 3.6712 | 1.0 | 33 | 3.2633 |
58
- | 2.6495 | 2.0 | 66 | 2.5112 |
59
- | 2.3212 | 3.0 | 99 | 2.3937 |
60
- | 2.0921 | 4.0 | 132 | 2.4587 |
61
- | 1.9862 | 5.0 | 165 | 2.8611 |
62
- | 1.9494 | 6.0 | 198 | 2.8478 |
63
- | 1.9216 | 7.0 | 231 | 3.0062 |
64
- | 1.9042 | 8.0 | 264 | 3.0057 |
65
 
66
 
67
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [ytcheng/llama-3-8b-hf-sm-lora-merged](https://huggingface.co/ytcheng/llama-3-8b-hf-sm-lora-merged) on the generator dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 3.0952
23
 
24
  ## Model description
25
 
 
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss |
56
+ |:-------------:|:------:|:----:|:---------------:|
57
+ | 3.6219 | 0.9851 | 33 | 3.2342 |
58
+ | 2.6017 | 2.0 | 67 | 2.4826 |
59
+ | 2.2366 | 2.9851 | 100 | 2.3797 |
60
+ | 2.0617 | 4.0 | 134 | 2.6861 |
61
+ | 1.9633 | 4.9851 | 167 | 3.0894 |
62
+ | 1.8968 | 6.0 | 201 | 3.1514 |
63
+ | 1.8985 | 6.9851 | 234 | 3.1113 |
64
+ | 1.8886 | 7.8806 | 264 | 3.0952 |
65
 
66
 
67
  ### Framework versions