perlthoughts
commited on
Commit
•
67ab971
1
Parent(s):
c7100ef
Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ This model was made by merging models based on Mistral with the SLERP merge meth
|
|
12 |
|
13 |
All model's weights were merged using the SLERP method. More information below.
|
14 |
|
15 |
-
Advantages of the SLERP
|
16 |
|
17 |
Spherical Linear Interpolation (SLERP)
|
18 |
Traditionally, model merging often resorts to weight averaging which, although straightforward, might not always capture the intricate features of the models being merged. The SLERP technique addresses this limitation, producing a blended model with characteristics smoothly interpolated from both parent models, ensuring the resultant model captures the essence of both its parents.
|
|
|
12 |
|
13 |
All model's weights were merged using the SLERP method. More information below.
|
14 |
|
15 |
+
Advantages of the SLERP vs averaging weights are as follows:
|
16 |
|
17 |
Spherical Linear Interpolation (SLERP)
|
18 |
Traditionally, model merging often resorts to weight averaging which, although straightforward, might not always capture the intricate features of the models being merged. The SLERP technique addresses this limitation, producing a blended model with characteristics smoothly interpolated from both parent models, ensuring the resultant model captures the essence of both its parents.
|