Timmyafolami commited on
Commit
a2f483d
1 Parent(s): 5c37b3a

End of training

Browse files
README.md CHANGED
@@ -12,12 +12,12 @@ model-index:
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/timmyafolami/huggingface/runs/w9qoroly)
16
  # peft_phi_2
17
 
18
- This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 8.4189
21
 
22
  ## Model description
23
 
@@ -49,9 +49,9 @@ The following hyperparameters were used during training:
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 8.0528 | 1.0 | 263 | 7.7273 |
53
- | 8.4081 | 2.0 | 526 | 8.4002 |
54
- | 8.4519 | 3.0 | 789 | 8.4189 |
55
 
56
 
57
  ### Framework versions
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/timmyafolami/huggingface/runs/dza6a6tk)
16
  # peft_phi_2
17
 
18
+ This model is a fine-tuned version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: nan
21
 
22
  ## Model description
23
 
 
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 6.4514 | 1.0 | 287 | 6.5988 |
53
+ | 0.0 | 2.0 | 574 | nan |
54
+ | 0.0 | 3.0 | 861 | nan |
55
 
56
 
57
  ### Framework versions
adapter_config.json CHANGED
@@ -20,9 +20,9 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
- "q_proj",
24
- "dense",
25
  "v_proj",
 
 
26
  "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
 
23
  "v_proj",
24
+ "dense",
25
+ "q_proj",
26
  "k_proj"
27
  ],
28
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:116d780c8804bff796c19aa92508c944234221658d7a27a2cd2b2effe68f775c
3
  size 41977360
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c16b7a7c84a6fc270c5f0d40f161714d6ef9544de4d276bb275a757581b613aa
3
  size 41977360
runs/Jul23_13-33-06_7e6c8afec7bd/events.out.tfevents.1721741592.7e6c8afec7bd.34.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e18b801f33f032f2f9623697f091b1ae56de1303b4ed354d8bc8ba39092db4e6
3
+ size 15629
tokenizer.json CHANGED
@@ -2,14 +2,12 @@
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
- "max_length": 512,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
- "strategy": {
11
- "Fixed": 512
12
- },
13
  "direction": "Right",
14
  "pad_to_multiple_of": null,
15
  "pad_id": 50256,
 
2
  "version": "1.0",
3
  "truncation": {
4
  "direction": "Right",
5
+ "max_length": 2048,
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
  "padding": {
10
+ "strategy": "BatchLongest",
 
 
11
  "direction": "Right",
12
  "pad_to_multiple_of": null,
13
  "pad_id": 50256,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:33e4fc51715dbe01009bc97ab28fbddd62e79c1d37adc8ca8d21de7eab7b1b4f
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3bcfe8268a59ca4611ee3edcb06a65f236d368c25248ed1e70d96028e4c3608
3
  size 5176