Small Model Learnability Gap: Models
Collection
24 items
•
Updated
•
1
This model is a fine-tuned version of Qwen/Qwen2.5-7B-Instruct on the MATH_training_Qwen_QwQ_32B_Preview dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.278 | 0.2999 | 200 | 0.3187 |
0.2849 | 0.5997 | 400 | 0.3021 |
0.3367 | 0.8996 | 600 | 0.2921 |
0.1591 | 1.1994 | 800 | 0.3060 |
0.1367 | 1.4993 | 1000 | 0.3032 |
0.0979 | 1.7991 | 1200 | 0.3018 |