cy948
/

Qwen2.5-Coder-1.5B-Instruct-Airscript

Generated from Trainer

Model card Files Files and versions Community

cy948 commited on Oct 28

Commit

5a36957

•

1 Parent(s): dcc8887

End of training

Files changed (1) hide show

README.md +17 -16

README.md CHANGED Viewed

@@ -44,27 +44,28 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 30
-- training_steps: 1599
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 2.1783        | 0.0625 | 100  | 2.1005          |
-| 1.8424        | 0.1251 | 200  | 1.8531          |
-| 1.7342        | 0.1876 | 300  | 1.7347          |
-| 1.6314        | 0.2502 | 400  | 1.6523          |
-| 1.5815        | 0.3127 | 500  | 1.5977          |
-| 1.5495        | 0.3752 | 600  | 1.5601          |
-| 1.5015        | 0.4378 | 700  | 1.5319          |
-| 1.4848        | 0.5003 | 800  | 1.5099          |
-| 1.4606        | 0.5629 | 900  | 1.4929          |
-| 1.4478        | 0.6254 | 1000 | 1.4813          |
-| 1.4428        | 0.6879 | 1100 | 1.4717          |
-| 1.4248        | 0.7505 | 1200 | 1.4657          |
-| 1.4218        | 0.8130 | 1300 | 1.4614          |
-| 1.418         | 0.8755 | 1400 | 1.4588          |
-| 1.4238        | 0.9381 | 1500 | 1.4578          |
 ### Framework versions

 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 30
+- training_steps: 1600
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 2.1783        | 0.0625 | 100  | 2.1006          |
+| 1.8426        | 0.125  | 200  | 1.8535          |
+| 1.7343        | 0.1875 | 300  | 1.7350          |
+| 1.6313        | 0.25   | 400  | 1.6520          |
+| 1.5817        | 0.3125 | 500  | 1.5982          |
+| 1.5498        | 0.375  | 600  | 1.5604          |
+| 1.5019        | 0.4375 | 700  | 1.5322          |
+| 1.4852        | 0.5    | 800  | 1.5103          |
+| 1.461         | 0.5625 | 900  | 1.4939          |
+| 1.4483        | 0.625  | 1000 | 1.4820          |
+| 1.4434        | 0.6875 | 1100 | 1.4723          |
+| 1.4254        | 0.75   | 1200 | 1.4659          |
+| 1.4224        | 0.8125 | 1300 | 1.4619          |
+| 1.4188        | 0.875  | 1400 | 1.4596          |
+| 1.4245        | 0.9375 | 1500 | 1.4585          |
+| 1.4172        | 1.0    | 1600 | 1.4578          |
 ### Framework versions