Undi95 commited on
Commit
4654715
1 Parent(s): f4b5a90

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -47
README.md CHANGED
@@ -1,51 +1,68 @@
1
  ---
2
- base_model:
3
- - NeverSleep/MiquMaid-70B-v3
4
- - NeverSleep/MiquMaid-v1-70B
5
- - migtissera/Tess-70B-v1.6
6
- - NeverSleep/MiquMaid-v2-70B
7
- - crestf411/daybreak-miqu-1-70b-v1.0-hf
8
- - sophosympatheia/Midnight-Miqu-70B-v1.0
9
- - 152334H/miqu-1-70b-sf
10
- library_name: transformers
11
  tags:
12
- - mergekit
 
13
  - merge
14
- license: cc-by-nc-2.0
15
  ---
16
- # out
17
-
18
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
19
-
20
- ## Merge Details
21
- ### Merge Method
22
-
23
- This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [152334H/miqu-1-70b-sf](https://huggingface.co/152334H/miqu-1-70b-sf) as a base.
24
-
25
- ### Models Merged
26
-
27
- The following models were included in the merge:
28
- * [NeverSleep/MiquMaid-70B-v3](https://huggingface.co/NeverSleep/MiquMaid-70B-v3)
29
- * [NeverSleep/MiquMaid-v1-70B](https://huggingface.co/NeverSleep/MiquMaid-v1-70B)
30
- * [migtissera/Tess-70B-v1.6](https://huggingface.co/migtissera/Tess-70B-v1.6)
31
- * [NeverSleep/MiquMaid-v2-70B](https://huggingface.co/NeverSleep/MiquMaid-v2-70B)
32
- * [crestf411/daybreak-miqu-1-70b-v1.0-hf](https://huggingface.co/crestf411/daybreak-miqu-1-70b-v1.0-hf)
33
- * [sophosympatheia/Midnight-Miqu-70B-v1.0](https://huggingface.co/sophosympatheia/Midnight-Miqu-70B-v1.0)
34
-
35
- ### Configuration
36
-
37
- The following YAML configuration was used to produce this model:
38
-
39
- ```yaml
40
- models:
41
- - model: 152334H/miqu-1-70b-sf
42
- - model: NeverSleep/MiquMaid-v1-70B
43
- - model: NeverSleep/MiquMaid-v2-70B
44
- - model: NeverSleep/MiquMaid-70B-v3
45
- - model: sophosympatheia/Midnight-Miqu-70B-v1.0
46
- - model: migtissera/Tess-70B-v1.6
47
- - model: crestf411/daybreak-miqu-1-70b-v1.0-hf
48
- merge_method: model_stock
49
- base_model: 152334H/miqu-1-70b-sf
50
- dtype: bfloat16
51
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: cc-by-nc-4.0
 
 
 
 
 
 
 
 
3
  tags:
4
+ - not-for-all-audiences
5
+ - nsfw
6
  - merge
 
7
  ---
8
+
9
+ ## MiquMaid v3
10
+
11
+ Check out our blogpost about this model series [Here!](https://ikaridevgit.github.io/index.html?blog=blogid-6&bo=true#Miqu-base) - Join our Discord server [Here!](https://discord.gg/Bb8pRUXy3Z)
12
+
13
+ <center><div style="width: 100%;">
14
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/63ab1241ad514ca8d1430003/4smLWpA55KgIbj4kH1zyF.png" style="display: block; margin: auto;">
15
+ </div></center>
16
+
17
+ This model uses the Alpaca **prompting format**
18
+
19
+ Model trained for RP conversation on Miqu-70B with our magic sauce. Then we made a enormous merge containing all out old iteration of Miqumaid, and some other RP Miqu based model, with the new [Model Stock](https://arxiv.org/abs/2403.19522) merging method.
20
+
21
+ ## Credits:
22
+ - Undi
23
+ - IkariDev
24
+
25
+ ## Description
26
+
27
+ This repo contains FP16 files of MiquMaid-v3-70B.
28
+
29
+ Switch: [FP16](https://huggingface.co/NeverSleep/MiquMaid-v3-70B) - [GGUF](https://huggingface.co/NeverSleep/MiquMaid-v3-70B-GGUF)
30
+
31
+ ## Training data used:
32
+ - [Aesir datasets](https://huggingface.co/MinervaAI)
33
+ - [NoRobots](https://huggingface.co/datasets/Doctor-Shotgun/no-robots-sharegpt)
34
+ - [limarp](https://huggingface.co/datasets/lemonilia/LimaRP) - 8k ctx
35
+ - [toxic-dpo-v0.1-sharegpt](https://huggingface.co/datasets/Undi95/toxic-dpo-v0.1-sharegpt)
36
+ - [ToxicQAFinal](https://huggingface.co/datasets/NobodyExistsOnTheInternet/ToxicQAFinal)
37
+ - [Ikari's Dataset]
38
+ - [Squish42/bluemoon-fandom-1-1-rp-cleaned](https://huggingface.co/datasets/Squish42/bluemoon-fandom-1-1-rp-cleaned) - 50% (randomly)
39
+ - [NobodyExistsOnTheInternet/PIPPAsharegptv2test](https://huggingface.co/datasets/NobodyExistsOnTheInternet/PIPPAsharegptv2test) - 5% (randomly)
40
+ - [cgato/SlimOrcaDedupCleaned](https://huggingface.co/datasets/cgato/SlimOrcaDedupCleaned) - 5% (randomly)
41
+
42
+
43
+ ## Models used
44
+
45
+ - [NeverSleep/MiquMaid-70B-v3-Base](https://huggingface.co/NeverSleep/MiquMaid-70B-v3-Base) [Private finetune]
46
+ - [NeverSleep/MiquMaid-v2-70B](https://huggingface.co/NeverSleep/MiquMaid-v2-70B)
47
+ - [NeverSleep/MiquMaid-v1-70B](https://huggingface.co/NeverSleep/MiquMaid-v1-70B)
48
+ - [migtissera/Tess-70B-v1.6](https://huggingface.co/migtissera/Tess-70B-v1.6)
49
+ - [crestf411/daybreak-miqu-1-70b-v1.0-hf](https://huggingface.co/crestf411/daybreak-miqu-1-70b-v1.0-hf)
50
+ - [sophosympatheia/Midnight-Miqu-70B-v1.0](https://huggingface.co/sophosympatheia/Midnight-Miqu-70B-v1.0)
51
+
52
+ ### Custom format:
53
+ ```
54
+ ### Instruction:
55
+ {system prompt}
56
+ ### Input:
57
+ {input}
58
+ ### Response:
59
+ {reply}
60
+ ```
61
+
62
+ Mistral [INST][/INST] prompt format should work too.
63
+
64
+ ## Others
65
+
66
+ Undi: If you want to support us, you can [here](https://ko-fi.com/undiai).
67
+
68
+ IkariDev: Visit my [retro/neocities style website](https://ikaridevgit.github.io/) please kek