End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.2557
 ## Model description
@@ -41,14 +41,16 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.1404        | 1.0   | 263  | 4.2557          |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: nan
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.9762        | 1.0   | 263  | nan             |
+| 0.0           | 2.0   | 526  | nan             |
+| 0.0           | 3.0   | 789  | nan             |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,10 +20,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "k_proj",
-    "dense",
     "v_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
     "v_proj",
+    "k_proj",
+    "dense"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:399e63143ffdd82e6fc1603e1470ba334150846db4c5b301e82eac210c0d6c41
 size 41977360

 version https://git-lfs.github.com/spec/v1
+oid sha256:fe648d6e90d61490d841d4f211b4665d7cc8137d8d69fb0b6b9bf5594e5b254c
 size 41977360

runs/Jul18_09-07-51_cc50c4e10f20/events.out.tfevents.1721293672.cc50c4e10f20.35.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:669befa42739741b7c1a0fc1df5ed4ea301ed8cb4f4e6d879428fbc52ea52667
+size 14759

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:addfd9a532679b84b78eb2d8600b6a5d64051cbb2b453b7280c0686ba17fa3f5
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:2e0c734946e74e0274bbb97cf27033af6710dec7e7a5afb7daa565228230c575
 size 5112