End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -103,7 +103,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/Qwen2.5-3B-Instruct](https://huggingface.co/unsloth/Qwen2.5-3B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6561
 ## Model description
@@ -141,8 +141,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.8513        | 0.0165 | 1    | 3.8587          |
-| 0.5981        | 0.4115 | 25   | 0.7051          |
-| 0.5867        | 0.8230 | 50   | 0.6561          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/Qwen2.5-3B-Instruct](https://huggingface.co/unsloth/Qwen2.5-3B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6542
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 2.8513        | 0.0165 | 1    | 3.8587          |
+| 0.5986        | 0.4115 | 25   | 0.7033          |
+| 0.5858        | 0.8230 | 50   | 0.6542          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "down_proj",
-    "v_proj",
     "gate_proj",
-    "up_proj",
-    "q_proj",
-    "k_proj",
-    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "k_proj",
+    "q_proj",
+    "o_proj",
+    "up_proj",
     "down_proj",
     "gate_proj",
+    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:02ec501265a7cdfa2c821893ee55d3b98ce44e34622cf016235dbba41dbdcb7b
 size 239650666

 version https://git-lfs.github.com/spec/v1
+oid sha256:92a2634694ed057a4db5fb83510c9f0228cffc092a1f1f5f1b148964e6ca136a
 size 239650666

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e17665ce5a09cfa29e3d3dcb84f65d1d7dacfd5b1884c75a226827ecf58dc0b8
 size 239536272

 version https://git-lfs.github.com/spec/v1
+oid sha256:cb9c433e67d7d67735ab4c08b384b700ad7bbb2ef098bc9db8b5f55f05f73f4a
 size 239536272

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:85e8dfa1540ca3fe1a9279429500b8935e685568e77962b57855c979b3e8ce15
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:94f8966a7f5610ce3154bf3af6bc87a6aa3dca284fbee83e288364d669bb4725
 size 6776