b09501048 commited on
Commit
5c4f584
1 Parent(s): 1eb2e50

End of training

Browse files
Files changed (1) hide show
  1. README.md +13 -0
README.md CHANGED
@@ -18,6 +18,8 @@ should probably proofread and complete it, then remove this comment. -->
18
  # ADL_Gemma4
19
 
20
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on an unknown dataset.
 
 
21
 
22
  ## Model description
23
 
@@ -45,6 +47,17 @@ The following hyperparameters were used during training:
45
  - lr_scheduler_warmup_ratio: 0.02
46
  - num_epochs: 5
47
 
 
 
 
 
 
 
 
 
 
 
 
48
  ### Framework versions
49
 
50
  - PEFT 0.13.2
 
18
  # ADL_Gemma4
19
 
20
  This model is a fine-tuned version of [zake7749/gemma-2-2b-it-chinese-kyara-dpo](https://huggingface.co/zake7749/gemma-2-2b-it-chinese-kyara-dpo) on an unknown dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 1.3535
23
 
24
  ## Model description
25
 
 
47
  - lr_scheduler_warmup_ratio: 0.02
48
  - num_epochs: 5
49
 
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss |
53
+ |:-------------:|:-----:|:----:|:---------------:|
54
+ | 1.5844 | 1.0 | 1250 | 1.4583 |
55
+ | 1.4602 | 2.0 | 2500 | 1.3865 |
56
+ | 1.4141 | 3.0 | 3750 | 1.3632 |
57
+ | 1.4011 | 4.0 | 5000 | 1.3547 |
58
+ | 1.3975 | 5.0 | 6250 | 1.3535 |
59
+
60
+
61
  ### Framework versions
62
 
63
  - PEFT 0.13.2