Timmyafolami commited on
Commit
5c37b3a
1 Parent(s): f83c353

End of training

Browse files
README.md CHANGED
@@ -12,11 +12,12 @@ model-index:
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
 
15
  # peft_phi_2
16
 
17
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: nan
20
 
21
  ## Model description
22
 
@@ -48,15 +49,15 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 4.9762 | 1.0 | 263 | nan |
52
- | 0.0 | 2.0 | 526 | nan |
53
- | 0.0 | 3.0 | 789 | nan |
54
 
55
 
56
  ### Framework versions
57
 
58
- - PEFT 0.11.1
59
- - Transformers 4.41.2
60
- - Pytorch 2.3.0+cu121
61
- - Datasets 2.20.0
62
  - Tokenizers 0.19.1
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/timmyafolami/huggingface/runs/w9qoroly)
16
  # peft_phi_2
17
 
18
  This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 8.4189
21
 
22
  ## Model description
23
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 8.0528 | 1.0 | 263 | 7.7273 |
53
+ | 8.4081 | 2.0 | 526 | 8.4002 |
54
+ | 8.4519 | 3.0 | 789 | 8.4189 |
55
 
56
 
57
  ### Framework versions
58
 
59
+ - PEFT 0.11.2.dev0
60
+ - Transformers 4.42.4
61
+ - Pytorch 2.1.2
62
+ - Datasets 2.19.2
63
  - Tokenizers 0.19.1
adapter_config.json CHANGED
@@ -21,9 +21,9 @@
21
  "revision": null,
22
  "target_modules": [
23
  "q_proj",
 
24
  "v_proj",
25
- "k_proj",
26
- "dense"
27
  ],
28
  "task_type": "CAUSAL_LM",
29
  "use_dora": false,
 
21
  "revision": null,
22
  "target_modules": [
23
  "q_proj",
24
+ "dense",
25
  "v_proj",
26
+ "k_proj"
 
27
  ],
28
  "task_type": "CAUSAL_LM",
29
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe648d6e90d61490d841d4f211b4665d7cc8137d8d69fb0b6b9bf5594e5b254c
3
  size 41977360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:116d780c8804bff796c19aa92508c944234221658d7a27a2cd2b2effe68f775c
3
  size 41977360
runs/Jul19_10-00-18_02ac936770be/events.out.tfevents.1721383227.02ac936770be.34.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:221f2f5748b90e09059e76b3e4fd09dcb76e4a6597fb7d94b6b5e62dfb41151e
3
+ size 14785
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2e0c734946e74e0274bbb97cf27033af6710dec7e7a5afb7daa565228230c575
3
- size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33e4fc51715dbe01009bc97ab28fbddd62e79c1d37adc8ca8d21de7eab7b1b4f
3
+ size 5176