erythropygia
/

Qwen2.5-7B-Instruct-OpenR1-Turkish

Text Generation

text-generation-inference

Model card Files Files and versions

erythropygia commited on Mar 5

Commit

e72a922

·

verified ·

1 Parent(s): 932e912

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ base_model:
 - **Base Model**: Qwen2.5-7B-Instructions
 - **Fine-tuned with GRPO and LoRa (Low-Rank Adaptation)**
 - **Context Window**: 4096 tokens
-- **The first model was fine-tuned on an L4 GPU for 50 hours and achieved a score of 51/100 on the gsm8k-tr dataset.
-    After additional fine-tuning on an A1000 GPU for 10 more hours, its score improved to 57/100 on the same dataset.**
 ## Usage

 - **Base Model**: Qwen2.5-7B-Instructions
 - **Fine-tuned with GRPO and LoRa (Low-Rank Adaptation)**
 - **Context Window**: 4096 tokens
+- **The first model was fine-tuned on an L4 GPU for 50 hours and achieved a score of 51/100(answer) on the gsm8k-tr dataset.
+    After additional fine-tuning on an A1000 GPU for 10 more hours, its score improved to 57/100(answer) on the same dataset.**
 ## Usage