Update README.md
Browse files
README.md
CHANGED
|
@@ -11,6 +11,8 @@ base_model:
|
|
| 11 |
|
| 12 |
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 2 training epochs.
|
| 13 |
|
|
|
|
|
|
|
| 14 |
## Training Details
|
| 15 |
|
| 16 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|
|
|
|
| 11 |
|
| 12 |
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 2 training epochs.
|
| 13 |
|
| 14 |
+
After testing more I found that the model does not always include reasoning, I will update with 3 epochs.
|
| 15 |
+
|
| 16 |
## Training Details
|
| 17 |
|
| 18 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|