End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ xformers_attention: null
 This model is a fine-tuned version of [EleutherAI/pythia-1b](https://huggingface.co/EleutherAI/pythia-1b) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6513
 ## Model description
@@ -140,9 +140,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 26.2925       | 0.0022 | 1    | 6.3621          |
-| 31.4505       | 0.0065 | 3    | 6.0551          |
-| 24.3794       | 0.0130 | 6    | 4.7808          |
-| 17.7134       | 0.0196 | 9    | 3.6513          |
 ### Framework versions

 This model is a fine-tuned version of [EleutherAI/pythia-1b](https://huggingface.co/EleutherAI/pythia-1b) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6329
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 26.2925       | 0.0022 | 1    | 6.3621          |
+| 31.458        | 0.0065 | 3    | 6.0527          |
+| 24.4518       | 0.0130 | 6    | 4.7729          |
+| 17.4177       | 0.0196 | 9    | 3.6329          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,10 +20,10 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "dense",
     "query_key_value",
-    "dense_4h_to_h",
-    "dense_h_to_4h"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "dense_h_to_4h",
     "query_key_value",
+    "dense",
+    "dense_4h_to_h"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f4f0d76be970bf2f086a9c154c6862f40882b7c43e1ee9100da508ad548e0ab6
 size 33601418

 version https://git-lfs.github.com/spec/v1
+oid sha256:e5c4e31bdc6eb971f3f287de0adbcee7bc937f71022a21c6ecf8a82af24e1f51
 size 33601418

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:57d10411134e4e8b7edda32ed065076e3056c883ffa784c0858322790cbad7b6
 size 33572288

 version https://git-lfs.github.com/spec/v1
+oid sha256:d45b8f2e0055ebb30f1397db12b068ac241a6c0352338292f386717953cc3b0d
 size 33572288

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7039ee5e93e39ae75f53f38b044442799d1e30cd7ff9a83e519c275615c657cc
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:b5626cd2613597d38b016ac503b35d867aeed1b5b31a1151465dc38b87f3fc7b
 size 6776