Update README.md
Browse files
README.md
CHANGED
|
@@ -17,8 +17,8 @@ base_model:
|
|
| 17 |
- **Base Model**: Qwen2.5-7B-Instructions
|
| 18 |
- **Fine-tuned with GRPO and LoRa (Low-Rank Adaptation)**
|
| 19 |
- **Context Window**: 4096 tokens
|
| 20 |
-
- **The first model was fine-tuned on an L4 GPU for 50 hours and achieved a score of 51/100 on the gsm8k-tr dataset.
|
| 21 |
-
After additional fine-tuning on an A1000 GPU for 10 more hours, its score improved to 57/100 on the same dataset.**
|
| 22 |
|
| 23 |
## Usage
|
| 24 |
|
|
|
|
| 17 |
- **Base Model**: Qwen2.5-7B-Instructions
|
| 18 |
- **Fine-tuned with GRPO and LoRa (Low-Rank Adaptation)**
|
| 19 |
- **Context Window**: 4096 tokens
|
| 20 |
+
- **The first model was fine-tuned on an L4 GPU for 50 hours and achieved a score of 51/100(answer) on the gsm8k-tr dataset.
|
| 21 |
+
After additional fine-tuning on an A1000 GPU for 10 more hours, its score improved to 57/100(answer) on the same dataset.**
|
| 22 |
|
| 23 |
## Usage
|
| 24 |
|