mistralai/mistral-instruct-generation_tr

Files changed (5) hide show

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [malhajar/Mistral-7B-Instruct-v0.2-turkish](https://huggingface.co/malhajar/Mistral-7B-Instruct-v0.2-turkish) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0052
 ## Model description
@@ -53,11 +53,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.2996        | 0.6061 | 20   | 0.0856          |
-| 0.0214        | 1.2121 | 40   | 0.0149          |
-| 0.0106        | 1.8182 | 60   | 0.0084          |
-| 0.0077        | 2.4242 | 80   | 0.0064          |
-| 0.0069        | 3.0303 | 100  | 0.0052          |
 ### Framework versions

 This model is a fine-tuned version of [malhajar/Mistral-7B-Instruct-v0.2-turkish](https://huggingface.co/malhajar/Mistral-7B-Instruct-v0.2-turkish) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0125
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.3706        | 0.1198 | 20   | 0.1194          |
+| 0.0327        | 0.2395 | 40   | 0.0267          |
+| 0.0231        | 0.3593 | 60   | 0.0182          |
+| 0.0167        | 0.4790 | 80   | 0.0147          |
+| 0.0155        | 0.5988 | 100  | 0.0125          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,8 +20,8 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:eb2e9a65ef5bd74ebe0848928ba17d71be2c3befe9c327397b94a8cc77b0286a
 size 27280152

 version https://git-lfs.github.com/spec/v1
+oid sha256:9eae570dc07c93f0a89858ad1ddadfa1466597b779d3798034c98087f01bd315
 size 27280152

runs/Jun04_08-26-57_ca1fc2d8e9cc/events.out.tfevents.1717489683.ca1fc2d8e9cc.207.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9faf06433c64a8741fb0a3bea787c252d9c721822796ce2e0dc2d4855c691343
+size 9105

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:599a2bb4233f440404a8cf9434ec257d86974abb6e9cd21c6f6bf9cbe680e110
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:b6f231e99809c7209e33afc43c300a85993ec0dfad320ef774c3c841ba22afdb
 size 5112