jazzson commited on
Commit
13b8f72
1 Parent(s): 7551a2f

End of training

Browse files
Files changed (1) hide show
  1. README.md +12 -12
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.4331
20
 
21
  ## Model description
22
 
@@ -49,17 +49,17 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
- | 2.2169 | 0.3556 | 200 | 2.0577 |
53
- | 2.0184 | 0.7111 | 400 | 1.9730 |
54
- | 1.8854 | 1.0667 | 600 | 1.9618 |
55
- | 1.6054 | 1.4222 | 800 | 1.9616 |
56
- | 1.6254 | 1.7778 | 1000 | 1.9407 |
57
- | 1.4225 | 2.1333 | 1200 | 2.1041 |
58
- | 1.1622 | 2.4889 | 1400 | 2.1172 |
59
- | 1.1694 | 2.8444 | 1600 | 2.1113 |
60
- | 0.9801 | 3.2 | 1800 | 2.4416 |
61
- | 0.8002 | 3.5556 | 2000 | 2.4504 |
62
- | 0.7998 | 3.9111 | 2200 | 2.4331 |
63
 
64
 
65
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.4426
20
 
21
  ## Model description
22
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:------:|:----:|:---------------:|
52
+ | 2.3055 | 0.3556 | 200 | 2.1449 |
53
+ | 2.107 | 0.7111 | 400 | 2.0629 |
54
+ | 1.9753 | 1.0667 | 600 | 2.0500 |
55
+ | 1.6997 | 1.4222 | 800 | 2.0477 |
56
+ | 1.7306 | 1.7778 | 1000 | 2.0290 |
57
+ | 1.5365 | 2.1333 | 1200 | 2.1725 |
58
+ | 1.2898 | 2.4889 | 1400 | 2.1792 |
59
+ | 1.3049 | 2.8444 | 1600 | 2.1651 |
60
+ | 1.1339 | 3.2 | 1800 | 2.4387 |
61
+ | 0.9677 | 3.5556 | 2000 | 2.4493 |
62
+ | 0.9654 | 3.9111 | 2200 | 2.4426 |
63
 
64
 
65
  ### Framework versions