sv469/phi-2_finetuned

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1627
 ## Model description
@@ -44,18 +44,16 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
-- training_steps: 300
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
-| 0.92          | 4.6512  | 50   | 0.3772          |
-| 0.2534        | 9.3023  | 100  | 0.2253          |
-| 0.1327        | 13.9535 | 150  | 0.1858          |
-| 0.0795        | 18.6047 | 200  | 0.1676          |
-| 0.0601        | 23.2558 | 250  | 0.1641          |
-| 0.0483        | 27.9070 | 300  | 0.1627          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1810
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 2
+- training_steps: 200
 ### Training results
 | Training Loss | Epoch   | Step | Validation Loss |
 |:-------------:|:-------:|:----:|:---------------:|
+| 0.9443        | 4.6512  | 50   | 0.3824          |
+| 0.2665        | 9.3023  | 100  | 0.2402          |
+| 0.1542        | 13.9535 | 150  | 0.1915          |
+| 0.1077        | 18.6047 | 200  | 0.1810          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,10 +20,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "k_proj",
     "v_proj",
-    "dense"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "v_proj",
+    "k_proj",
+    "dense",
+    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:87917607e271e584fc3b86415fdfd0cab6ace7f7d2be189e9eb9f91ddf1967f0
 size 83920464

 version https://git-lfs.github.com/spec/v1
+oid sha256:f8398c55eddd79fdfae52a476cc61b5da86fc9356fa61d771456a6e45aafa8f8
 size 83920464

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f99b3219f11780e86b2f5401e3c4473722529a744a068013da9406cb4ef7d065
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:daad53ab6a86dec224473c87f14c517588246fbec08b222e8294552c4709945c
 size 5112