End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -12,11 +12,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # peft_phi_2
 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: nan
 ## Model description
@@ -48,15 +49,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.9762        | 1.0   | 263  | nan             |
-| 0.0           | 2.0   | 526  | nan             |
-| 0.0           | 3.0   | 789  | nan             |
 ### Framework versions
-- PEFT 0.11.1
-- Transformers 4.41.2
-- Pytorch 2.3.0+cu121
-- Datasets 2.20.0
 - Tokenizers 0.19.1

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/timmyafolami/huggingface/runs/w9qoroly)
 # peft_phi_2
 This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 8.4189
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 8.0528        | 1.0   | 263  | 7.7273          |
+| 8.4081        | 2.0   | 526  | 8.4002          |
+| 8.4519        | 3.0   | 789  | 8.4189          |
 ### Framework versions
+- PEFT 0.11.2.dev0
+- Transformers 4.42.4
+- Pytorch 2.1.2
+- Datasets 2.19.2
 - Tokenizers 0.19.1

adapter_config.json CHANGED Viewed

@@ -21,9 +21,9 @@
   "revision": null,
   "target_modules": [
     "q_proj",
     "v_proj",
-    "k_proj",
-    "dense"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "revision": null,
   "target_modules": [
     "q_proj",
+    "dense",
     "v_proj",
+    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fe648d6e90d61490d841d4f211b4665d7cc8137d8d69fb0b6b9bf5594e5b254c
 size 41977360

 version https://git-lfs.github.com/spec/v1
+oid sha256:116d780c8804bff796c19aa92508c944234221658d7a27a2cd2b2effe68f775c
 size 41977360

runs/Jul19_10-00-18_02ac936770be/events.out.tfevents.1721383227.02ac936770be.34.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:221f2f5748b90e09059e76b3e4fd09dcb76e4a6597fb7d94b6b5e62dfb41151e
+size 14785

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2e0c734946e74e0274bbb97cf27033af6710dec7e7a5afb7daa565228230c575
-size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:33e4fc51715dbe01009bc97ab28fbddd62e79c1d37adc8ca8d21de7eab7b1b4f
+size 5176