DeepDream2045 commited on
Commit
13e1290
1 Parent(s): d42ad43

End of training

Browse files
README.md CHANGED
@@ -104,7 +104,7 @@ xformers_attention: true
104
 
105
  This model is a fine-tuned version of [echarlaix/tiny-random-PhiForCausalLM](https://huggingface.co/echarlaix/tiny-random-PhiForCausalLM) on the None dataset.
106
  It achieves the following results on the evaluation set:
107
- - Loss: 6.9019
108
 
109
  ## Model description
110
 
@@ -141,9 +141,9 @@ The following hyperparameters were used during training:
141
 
142
  | Training Loss | Epoch | Step | Validation Loss |
143
  |:-------------:|:------:|:----:|:---------------:|
144
- | 6.9356 | 0.0471 | 1 | 6.9337 |
145
- | 6.7366 | 1.1765 | 25 | 6.9093 |
146
- | 6.9713 | 2.3529 | 50 | 6.9019 |
147
 
148
 
149
  ### Framework versions
 
104
 
105
  This model is a fine-tuned version of [echarlaix/tiny-random-PhiForCausalLM](https://huggingface.co/echarlaix/tiny-random-PhiForCausalLM) on the None dataset.
106
  It achieves the following results on the evaluation set:
107
+ - Loss: 6.8986
108
 
109
  ## Model description
110
 
 
141
 
142
  | Training Loss | Epoch | Step | Validation Loss |
143
  |:-------------:|:------:|:----:|:---------------:|
144
+ | 6.9356 | 0.0471 | 1 | 6.9338 |
145
+ | 6.7342 | 1.1765 | 25 | 6.9071 |
146
+ | 6.9684 | 2.3529 | 50 | 6.8986 |
147
 
148
 
149
  ### Framework versions
adapter_config.json CHANGED
@@ -20,11 +20,11 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
- "dense",
24
- "fc2",
25
  "q_proj",
 
26
  "v_proj",
27
  "k_proj",
 
28
  "fc1"
29
  ],
30
  "task_type": "CAUSAL_LM",
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
 
23
  "q_proj",
24
+ "dense",
25
  "v_proj",
26
  "k_proj",
27
+ "fc2",
28
  "fc1"
29
  ],
30
  "task_type": "CAUSAL_LM",
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:faa77e0b8cea75f74a8074c62486126eae377e71ad1f99538c3588d95016ad7a
3
  size 243839
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:462549fa916bc791cb232ad5f78c36d471a3437f04aa829dce7e8901d64958e2
3
  size 243839
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37fcd80437072c4a9d5d645a7515b0a3920a8019ed1ad18088bbb863fedeb6ea
3
  size 237402
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c7b5bcf2203e97f03c82650161ac06d9a4babefadb305abb360708258f5361cc
3
  size 237402
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ac40a2a8dba01c5dfec735379b0d6aaafc1124d9f51e774c8273efc8e6fc7714
3
  size 6776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f84aa723924298c428cfc079f9bc73e01baed2fb1275230b50fa001c00d4943
3
  size 6776