Ejafa commited on
Commit
d582bc6
1 Parent(s): 59a0af6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -18,6 +18,12 @@ model-index:
18
 
19
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
20
  should probably proofread and complete it, then remove this comment. -->
 
 
 
 
 
 
21
 
22
  # phi-3-mini-128k-instruct-simpo-lr-5e-07-gamma-1.5
23
 
 
18
 
19
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
20
  should probably proofread and complete it, then remove this comment. -->
21
+ ## Description
22
+ This model was trained as part of the Reinforcement Learning - 24 project at Peking University, focusing on [simpo].
23
+
24
+ ## Authors
25
+ - Ejafa Bassam
26
+ - Yaroslav Ponomarenko
27
 
28
  # phi-3-mini-128k-instruct-simpo-lr-5e-07-gamma-1.5
29