grimjim commited on
Commit
a38702b
1 Parent(s): da3b41b

Initial release

Browse files
.gitattributes CHANGED
@@ -4,6 +4,7 @@
4
  *.bz2 filter=lfs diff=lfs merge=lfs -text
5
  *.ckpt filter=lfs diff=lfs merge=lfs -text
6
  *.ftz filter=lfs diff=lfs merge=lfs -text
 
7
  *.gz filter=lfs diff=lfs merge=lfs -text
8
  *.h5 filter=lfs diff=lfs merge=lfs -text
9
  *.joblib filter=lfs diff=lfs merge=lfs -text
 
4
  *.bz2 filter=lfs diff=lfs merge=lfs -text
5
  *.ckpt filter=lfs diff=lfs merge=lfs -text
6
  *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gguf filter=lfs diff=lfs merge=lfs -text
8
  *.gz filter=lfs diff=lfs merge=lfs -text
9
  *.h5 filter=lfs diff=lfs merge=lfs -text
10
  *.joblib filter=lfs diff=lfs merge=lfs -text
README.md CHANGED
@@ -1,3 +1,54 @@
1
  ---
 
 
 
 
 
 
 
2
  license: cc-by-nc-4.0
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ base_model:
3
+ - grimjim/zephyr-wizard-kuno-royale-BF16-merge-7B
4
+ - grimjim/cuckoo-starling-7B
5
+ library_name: transformers
6
+ tags:
7
+ - mergekit
8
+ - merge
9
  license: cc-by-nc-4.0
10
+ pipeline_tag: text-generation
11
  ---
12
+ # rogue-enchantress-32k-7B-GGUF
13
+
14
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
15
+
16
+ An ambition of this merge was to augment text generation with the potential creative richness of the WizardLM-2 7B and Zephyr-7B-Beta models, the reasoning of the Starling-LM-7B-beta model, and extended context length of Mistral v0.2.
17
+
18
+ The resulting model is very attentive to character card descriptions and capable of applying reasoning. This model is in the smarter side, attentive to context and formatting. The model is creative and "wants" to write, incorporating details cooperatively with occasional runaway narration if it finds that the prompt leans that way.
19
+
20
+ Tested with ChatML Instruct prompts, temperature 1.0, and minP 0.02.
21
+
22
+ - Full weights: [grimjim/rogue-enchantress-32k-7B](https://huggingface.co/grimjim/rogue-enchantress-32k-7B)
23
+ - GGUF quants: [grimjim/rogue-enchantress-32k-7B-GGUF](https://huggingface.co/grimjim/rogue-enchantress-32k-7B-GGUF)
24
+
25
+ ## Merge Details
26
+ ### Merge Method
27
+
28
+ This model was merged using the SLERP merge method.
29
+
30
+ ### Models Merged
31
+
32
+ The following models were included in the merge:
33
+ * [grimjim/zephyr-wizard-kuno-royale-BF16-merge-7B](https://huggingface.co/grimjim/zephyr-wizard-kuno-royale-BF16-merge-7B)
34
+ * [grimjim/cuckoo-starling-7B](https://huggingface.co/grimjim/cuckoo-starling-7B)
35
+
36
+ ### Configuration
37
+
38
+ The following YAML configuration was used to produce this model:
39
+
40
+ ```yaml
41
+ slices:
42
+ - sources:
43
+ - model: grimjim/zephyr-wizard-kuno-royale-BF16-merge-7B
44
+ layer_range: [0,32]
45
+ - model: grimjim/cuckoo-starling-7B
46
+ layer_range: [0,32]
47
+ merge_method: slerp
48
+ base_model: grimjim/zephyr-wizard-kuno-royale-BF16-merge-7B
49
+ parameters:
50
+ t:
51
+ - value: 0.5
52
+ dtype: bfloat16
53
+
54
+ ```
rogue-enchantress-32k-7B.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6fbf0bd1413e6e53a8e89cc3fca88eba07ab20175e3ffb52a03c74e7dff2a1e
3
+ size 4368439584
rogue-enchantress-32k-7B.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3956d556e9a6e3d8108432aac1aacaf4913a7342fe075b7ae4dac9afb3c836ef
3
+ size 5131409696
rogue-enchantress-32k-7B.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bdc80458b7fd8d44a23b7f777730953bcb57f7bfb9b0bded2f9e2b41af38ba18
3
+ size 5942065440
rogue-enchantress-32k-7B.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:449f106b8745cf924eae1b5c6d3ef520c4765c509ece617182bffa20e3fba057
3
+ size 7695857952