jsgreenawalt
commited on
Commit
•
76f2c99
1
Parent(s):
c1329d0
Update README.md
Browse files
README.md
CHANGED
@@ -11,12 +11,12 @@ tags:
|
|
11 |
---
|
12 |
# An experimental intermediate merge
|
13 |
|
14 |
-
This merge is intended as an intermediate merge for further merges. It's useable as-is, doesn't show any glaring signs of broken behavior. I've included a
|
15 |
|
16 |
The intuition behind this merge is as follows:
|
17 |
|
18 |
We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
|
19 |
-
We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight on any non-overlapping weights
|
20 |
In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
|
21 |
|
22 |
Per the mergekit docs:
|
|
|
11 |
---
|
12 |
# An experimental intermediate merge
|
13 |
|
14 |
+
This merge is intended as an intermediate merge for further merges. It's useable as-is, and doesn't show any glaring signs of broken behavior. I've included a Q8_0 gguf in the repo if anyone is curious to try it.
|
15 |
|
16 |
The intuition behind this merge is as follows:
|
17 |
|
18 |
We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
|
19 |
+
We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight from SimPO's deltas on any non-overlapping weights
|
20 |
In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
|
21 |
|
22 |
Per the mergekit docs:
|