Pinkstack commited on
Commit
c4110be
·
verified ·
1 Parent(s): 7a27fba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ pipeline_tag: text-generation
23
  # Information
24
  Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
25
 
26
- Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
27
 
28
  # Format
29
  ```
 
23
  # Information
24
  Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
25
 
26
+ Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
27
 
28
  # Format
29
  ```