RichardErkhov commited on
Commit
75f98f2
1 Parent(s): 4aec54c

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +217 -0
README.md ADDED
@@ -0,0 +1,217 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ FrankenVillain-7B-v1 - GGUF
11
+ - Model creator: https://huggingface.co/luqmanxyz/
12
+ - Original model: https://huggingface.co/luqmanxyz/FrankenVillain-7B-v1/
13
+
14
+
15
+ | Name | Quant method | Size |
16
+ | ---- | ---- | ---- |
17
+ | [FrankenVillain-7B-v1.Q2_K.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q2_K.gguf) | Q2_K | 3.73GB |
18
+ | [FrankenVillain-7B-v1.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.IQ3_XS.gguf) | IQ3_XS | 4.14GB |
19
+ | [FrankenVillain-7B-v1.IQ3_S.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.IQ3_S.gguf) | IQ3_S | 4.37GB |
20
+ | [FrankenVillain-7B-v1.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q3_K_S.gguf) | Q3_K_S | 4.34GB |
21
+ | [FrankenVillain-7B-v1.IQ3_M.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.IQ3_M.gguf) | IQ3_M | 4.51GB |
22
+ | [FrankenVillain-7B-v1.Q3_K.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q3_K.gguf) | Q3_K | 4.84GB |
23
+ | [FrankenVillain-7B-v1.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q3_K_M.gguf) | Q3_K_M | 4.84GB |
24
+ | [FrankenVillain-7B-v1.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q3_K_L.gguf) | Q3_K_L | 5.26GB |
25
+ | [FrankenVillain-7B-v1.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.IQ4_XS.gguf) | IQ4_XS | 5.43GB |
26
+ | [FrankenVillain-7B-v1.Q4_0.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q4_0.gguf) | Q4_0 | 5.66GB |
27
+ | [FrankenVillain-7B-v1.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.IQ4_NL.gguf) | IQ4_NL | 5.72GB |
28
+ | [FrankenVillain-7B-v1.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q4_K_S.gguf) | Q4_K_S | 5.7GB |
29
+ | [FrankenVillain-7B-v1.Q4_K.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q4_K.gguf) | Q4_K | 6.02GB |
30
+ | [FrankenVillain-7B-v1.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q4_K_M.gguf) | Q4_K_M | 6.02GB |
31
+ | [FrankenVillain-7B-v1.Q4_1.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q4_1.gguf) | Q4_1 | 6.27GB |
32
+ | [FrankenVillain-7B-v1.Q5_0.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q5_0.gguf) | Q5_0 | 6.89GB |
33
+ | [FrankenVillain-7B-v1.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q5_K_S.gguf) | Q5_K_S | 6.89GB |
34
+ | [FrankenVillain-7B-v1.Q5_K.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q5_K.gguf) | Q5_K | 7.08GB |
35
+ | [FrankenVillain-7B-v1.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q5_K_M.gguf) | Q5_K_M | 7.08GB |
36
+ | [FrankenVillain-7B-v1.Q5_1.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q5_1.gguf) | Q5_1 | 7.51GB |
37
+ | [FrankenVillain-7B-v1.Q6_K.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q6_K.gguf) | Q6_K | 8.2GB |
38
+ | [FrankenVillain-7B-v1.Q8_0.gguf](https://huggingface.co/RichardErkhov/luqmanxyz_-_FrankenVillain-7B-v1-gguf/blob/main/FrankenVillain-7B-v1.Q8_0.gguf) | Q8_0 | 10.62GB |
39
+
40
+
41
+
42
+
43
+ Original model description:
44
+ ---
45
+ license: apache-2.0
46
+ tags:
47
+ - merge
48
+ - mergekit
49
+ - jeonsworld/CarbonVillain-en-10.7B-v1
50
+ - jeonsworld/CarbonVillain-en-10.7B-v1
51
+ base_model:
52
+ - jeonsworld/CarbonVillain-en-10.7B-v1
53
+ - jeonsworld/CarbonVillain-en-10.7B-v1
54
+ model-index:
55
+ - name: FrankenVillain-7B-v1
56
+ results:
57
+ - task:
58
+ type: text-generation
59
+ name: Text Generation
60
+ dataset:
61
+ name: AI2 Reasoning Challenge (25-Shot)
62
+ type: ai2_arc
63
+ config: ARC-Challenge
64
+ split: test
65
+ args:
66
+ num_few_shot: 25
67
+ metrics:
68
+ - type: acc_norm
69
+ value: 42.75
70
+ name: normalized accuracy
71
+ source:
72
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
73
+ name: Open LLM Leaderboard
74
+ - task:
75
+ type: text-generation
76
+ name: Text Generation
77
+ dataset:
78
+ name: HellaSwag (10-Shot)
79
+ type: hellaswag
80
+ split: validation
81
+ args:
82
+ num_few_shot: 10
83
+ metrics:
84
+ - type: acc_norm
85
+ value: 51.52
86
+ name: normalized accuracy
87
+ source:
88
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
89
+ name: Open LLM Leaderboard
90
+ - task:
91
+ type: text-generation
92
+ name: Text Generation
93
+ dataset:
94
+ name: MMLU (5-Shot)
95
+ type: cais/mmlu
96
+ config: all
97
+ split: test
98
+ args:
99
+ num_few_shot: 5
100
+ metrics:
101
+ - type: acc
102
+ value: 48.6
103
+ name: accuracy
104
+ source:
105
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
106
+ name: Open LLM Leaderboard
107
+ - task:
108
+ type: text-generation
109
+ name: Text Generation
110
+ dataset:
111
+ name: TruthfulQA (0-shot)
112
+ type: truthful_qa
113
+ config: multiple_choice
114
+ split: validation
115
+ args:
116
+ num_few_shot: 0
117
+ metrics:
118
+ - type: mc2
119
+ value: 56.19
120
+ source:
121
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
122
+ name: Open LLM Leaderboard
123
+ - task:
124
+ type: text-generation
125
+ name: Text Generation
126
+ dataset:
127
+ name: Winogrande (5-shot)
128
+ type: winogrande
129
+ config: winogrande_xl
130
+ split: validation
131
+ args:
132
+ num_few_shot: 5
133
+ metrics:
134
+ - type: acc
135
+ value: 73.01
136
+ name: accuracy
137
+ source:
138
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
139
+ name: Open LLM Leaderboard
140
+ - task:
141
+ type: text-generation
142
+ name: Text Generation
143
+ dataset:
144
+ name: GSM8k (5-shot)
145
+ type: gsm8k
146
+ config: main
147
+ split: test
148
+ args:
149
+ num_few_shot: 5
150
+ metrics:
151
+ - type: acc
152
+ value: 0.0
153
+ name: accuracy
154
+ source:
155
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=luqmanxyz/FrankenVillain-7B-v1
156
+ name: Open LLM Leaderboard
157
+ ---
158
+
159
+ # FrankenVillain-7B-v1
160
+
161
+ FrankenVillain-7B-v1 is a Franken merge of the following models using [mergekit](https://github.com/cg123/mergekit
162
+ * [jeonsworld/CarbonVillain-en-10.7B-v1](https://huggingface.co/jeonsworld/CarbonVillain-en-10.7B-v1)
163
+ * [jeonsworld/CarbonVillain-en-10.7B-v1](https://huggingface.co/jeonsworld/CarbonVillain-en-10.7B-v1)
164
+
165
+ ## 🧩 Configuration
166
+
167
+ ```yaml
168
+ slices:
169
+ - sources:
170
+ - model: jeonsworld/CarbonVillain-en-10.7B-v1
171
+ layer_range: [0, 24]
172
+ - sources:
173
+ - model: jeonsworld/CarbonVillain-en-10.7B-v1
174
+ layer_range: [8, 32]
175
+ merge_method: passthrough
176
+ dtype: bfloat16
177
+ ```
178
+
179
+ ## 💻 Usage
180
+
181
+ ```python
182
+ !pip install -qU transformers accelerate
183
+
184
+ from transformers import AutoTokenizer
185
+ import transformers
186
+ import torch
187
+
188
+ model = "luqmanxyz/FrankenVillain-7B-v1"
189
+ messages = [{"role": "user", "content": "What are the 3 planets closest to the sun"}]
190
+
191
+ tokenizer = AutoTokenizer.from_pretrained(model)
192
+ prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
193
+ pipeline = transformers.pipeline(
194
+ "text-generation",
195
+ model=model,
196
+ torch_dtype=torch.float16,
197
+ device_map="auto",
198
+ )
199
+
200
+ outputs = pipeline(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
201
+ print(outputs[0]["generated_text"])
202
+ ```
203
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
204
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_luqmanxyz__FrankenVillain-7B-v1)
205
+
206
+ | Metric |Value|
207
+ |---------------------------------|----:|
208
+ |Avg. |45.34|
209
+ |AI2 Reasoning Challenge (25-Shot)|42.75|
210
+ |HellaSwag (10-Shot) |51.52|
211
+ |MMLU (5-Shot) |48.60|
212
+ |TruthfulQA (0-shot) |56.19|
213
+ |Winogrande (5-shot) |73.01|
214
+ |GSM8k (5-shot) | 0.00|
215
+
216
+
217
+