alchemonaut
/

BoreanGale-70B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

LilyWinter commited on Feb 2

Commit

14bdaac

•

1 Parent(s): 87792cc

Create README.md

Files changed (1) hide show

README.md +30 -0

README.md ADDED Viewed

	@@ -0,0 +1,30 @@

+---
+tags:
+- merge
+---
+# BoreanGale-70B
+A merge using a custom algorithm of:
+- [152334H/miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf)
+- [Sao10K/WinterGoddess-1.4x-70B-L2](https://huggingface.co/Sao10K/WinterGoddess-1.4x-70B-L2)
+This merge retains most of the weights of Miqu, but when a weight is similar between the two, it is interpolated to the WinterGoddess value. A parameter *t* specifies the sameness threshold. When the distance between two values is below *t*, the weight from WinterGoddess is used.
+This version of the model uses *t* = 0.001. *t* was selected so that very few but some weights are fully switched to WinterGoddess. Model quality rapidly degrades above *t* = 0.0025:
+- *t* = 0.001: This model
+- *t* = 0.0025: Generates one paragraph okay, but then reverts to garbage
+- *t* = 0.005: Garbage; semi-related word lists
+- *t* = 0.01: Garbage; pseudorandom tokens output
+```
+    t: Union[float, np.ndarray],
+    v0: Union[np.ndarray, torch.Tensor],
+    v1: Union[np.ndarray, torch.Tensor],
+...
+    lweight = numpy.absolute(v0-v1)
+    lweight = t / lweight
+    lweight = numpy.nan_to_num(lweight, nan=1.0, posinf=1.0, neginf=1.0)
+    numpy.clip(lweight, a_min=0.0, a_max=1.0, out=lweight)
+    res = lerp(lweight,v0,v1)
+```