jsgreenawalt
/

gemma-2-9B-it-wpo-simpo-della-test-1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jsgreenawalt commited on Aug 27

Commit

76f2c99

•

1 Parent(s): c1329d0

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -11,12 +11,12 @@ tags:
 ---
 # An experimental intermediate merge
-This merge is intended as an intermediate merge for further merges. It's useable as-is, doesn't show any glaring signs of broken behavior. I've included a Q8 gguf in the repo if anyone's curious to try it.
 The intuition behind this merge is as follows:
 We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
-We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight on any non-overlapping weights
 In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
 Per the mergekit docs:

 ---
 # An experimental intermediate merge
+This merge is intended as an intermediate merge for further merges. It's useable as-is, and doesn't show any glaring signs of broken behavior. I've included a Q8_0 gguf in the repo if anyone is curious to try it.
 The intuition behind this merge is as follows:
 We keep the top 65 percent of weight deltas from the WPO-HB fine tune at a (very near) 1.0 weight
+We 'flood fill' the remaining 45 percent of model weights with the SimPO weights. Because normalize is set to true, this results in a 1.0 weight from SimPO's deltas on any non-overlapping weights
 In cases of overlap with the top 65 weights from WPO-HB, the relative weight contribution for SimPO is near zero. In cases of non-overlap, each model gets a 1.0 or very near 1.0 weight for the merge.
 Per the mergekit docs: