better
Browse files- README.md +1 -1
- config.json +1 -1
- model-00001-of-00003.safetensors +1 -1
- model-00002-of-00003.safetensors +1 -1
- model-00003-of-00003.safetensors +1 -1
README.md
CHANGED
@@ -34,7 +34,7 @@ and merged in
|
|
34 |
|
35 |
### 9-partition merge
|
36 |
|
37 |
-
All of the layers were partitioned in to 9 random bins. Alternating models were slerped at [1...0] (inputs), and [0...1] (outputs) gradients; except attention, which was slerped at 0.
|
38 |
|
39 |
This means that the model is still predominantly ordered around the base model - including half of the input and output layers, and 28% of attention.
|
40 |
|
|
|
34 |
|
35 |
### 9-partition merge
|
36 |
|
37 |
+
All of the layers were partitioned in to 9 random bins. Alternating models were slerped at [1...0.5] (inputs), and [0.5...1] (outputs) gradients; except attention, which was slerped at 0.97 (with a drop rate of .28).
|
38 |
|
39 |
This means that the model is still predominantly ordered around the base model - including half of the input and output layers, and 28% of attention.
|
40 |
|
config.json
CHANGED
@@ -1,5 +1,5 @@
|
|
1 |
{
|
2 |
-
"_name_or_path": "OpenHermes-2.5-Mistral-7B",
|
3 |
"architectures": [
|
4 |
"MistralForCausalLM"
|
5 |
],
|
|
|
1 |
{
|
2 |
+
"_name_or_path": "../../llama/raw/merge/math/OpenHermes-2.5-Mistral-7B",
|
3 |
"architectures": [
|
4 |
"MistralForCausalLM"
|
5 |
],
|
model-00001-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4943178624
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f115c7c3446b2920cea1c486e7f33f27c7cd311a2c13addf26daf79f0a2ce84e
|
3 |
size 4943178624
|
model-00002-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4999819232
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c25c26b425b2c9a3ed0e5279610168e17ee2c828d4971ff6408b5ed3d2ce8a2d
|
3 |
size 4999819232
|
model-00003-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4540532640
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9b74362b58b5698b5b57ce87a3a84152ac55b22df0558fd1491c58e73afc08c5
|
3 |
size 4540532640
|