Pinkstack
/

Superthoughts-lite-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Pinkstack commited on 16 days ago

Commit

c4110be

·

verified ·

1 Parent(s): 7a27fba

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ pipeline_tag: text-generation
 # Information
 Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
-Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
 # Format
 ```

 # Information
 Advanced, high-quality and lite reasoning for a tiny size that you can run on your phone.
+Trained similarly to Deepseek R1, we used Smollm2 as a base model, then we've SFT fine tuned in on reasoning & modified the tokenizer slightly, after the SFT fine tuning we used GRPO to further amplify it's mathematics & problem solving abilities.
 # Format
 ```