sc-dev
Collection
4 items
•
Updated
This model is a fine-tuned version of deepseek-ai/DeepSeek-R1-Distill-Qwen-7B on the sc_preference_v2 dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B