cy948
/

Qwen2.5-Coder-1.5B-Instruct-Airscript

Generated from Trainer

Model card Files Files and versions Community

cy948 commited on Oct 27, 2024

Commit

7305ace

•

1 Parent(s): b85d156

End of training

Files changed (1) hide show

README.md +16 -16

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Qwen/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2188
 ## Model description
@@ -50,21 +50,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.9398        | 0.0625 | 100  | 1.9159          |
-| 1.6308        | 0.1251 | 200  | 1.6287          |
-| 1.4895        | 0.1876 | 300  | 1.4939          |
-| 1.3812        | 0.2502 | 400  | 1.4127          |
-| 1.316         | 0.3127 | 500  | 1.3550          |
-| 1.2703        | 0.3752 | 600  | 1.3150          |
-| 1.2175        | 0.4378 | 700  | 1.2849          |
-| 1.1885        | 0.5003 | 800  | 1.2648          |
-| 1.1624        | 0.5629 | 900  | 1.2497          |
-| 1.143         | 0.6254 | 1000 | 1.2400          |
-| 1.1334        | 0.6879 | 1100 | 1.2319          |
-| 1.1118        | 0.7505 | 1200 | 1.2259          |
-| 1.1102        | 0.8130 | 1300 | 1.2215          |
-| 1.1017        | 0.8755 | 1400 | 1.2197          |
-| 1.1066        | 0.9381 | 1500 | 1.2188          |
 ### Framework versions

 This model is a fine-tuned version of [Qwen/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/Qwen/Qwen2.5-Coder-1.5B-Instruct) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4578
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 2.1783        | 0.0625 | 100  | 2.1005          |
+| 1.8424        | 0.1251 | 200  | 1.8531          |
+| 1.7342        | 0.1876 | 300  | 1.7347          |
+| 1.6314        | 0.2502 | 400  | 1.6523          |
+| 1.5815        | 0.3127 | 500  | 1.5977          |
+| 1.5495        | 0.3752 | 600  | 1.5601          |
+| 1.5015        | 0.4378 | 700  | 1.5319          |
+| 1.4848        | 0.5003 | 800  | 1.5099          |
+| 1.4606        | 0.5629 | 900  | 1.4929          |
+| 1.4478        | 0.6254 | 1000 | 1.4813          |
+| 1.4428        | 0.6879 | 1100 | 1.4717          |
+| 1.4248        | 0.7505 | 1200 | 1.4657          |
+| 1.4218        | 0.8130 | 1300 | 1.4614          |
+| 1.418         | 0.8755 | 1400 | 1.4588          |
+| 1.4238        | 0.9381 | 1500 | 1.4578          |
 ### Framework versions