aashish1904 commited on
Commit
3783f0a
1 Parent(s): e646e58

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +349 -0
README.md ADDED
@@ -0,0 +1,349 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ language:
5
+ - en
6
+ license: llama3.2
7
+ tags:
8
+ - shining-valiant
9
+ - shining-valiant-2
10
+ - valiant
11
+ - valiant-labs
12
+ - llama
13
+ - llama-3.2
14
+ - llama-3.2-instruct
15
+ - llama-3.2-instruct-3b
16
+ - llama-3
17
+ - llama-3-instruct
18
+ - llama-3-instruct-3b
19
+ - 3b
20
+ - science
21
+ - physics
22
+ - biology
23
+ - chemistry
24
+ - compsci
25
+ - computer-science
26
+ - engineering
27
+ - technical
28
+ - conversational
29
+ - chat
30
+ - instruct
31
+ base_model: meta-llama/Llama-3.2-3B-Instruct
32
+ datasets:
33
+ - sequelbox/Celestia
34
+ - sequelbox/Supernova
35
+ pipeline_tag: text-generation
36
+ model_type: llama
37
+ model-index:
38
+ - name: Llama3.2-3B-ShiningValiant2
39
+ results:
40
+ - task:
41
+ type: text-generation
42
+ name: Text Generation
43
+ dataset:
44
+ name: Winogrande (5-shot)
45
+ type: winogrande
46
+ args:
47
+ num_few_shot: 5
48
+ metrics:
49
+ - type: acc
50
+ value: 69.85
51
+ name: acc
52
+ - task:
53
+ type: text-generation
54
+ name: Text Generation
55
+ dataset:
56
+ name: ARC Challenge (25-Shot)
57
+ type: arc_challenge
58
+ args:
59
+ num_few_shot: 25
60
+ metrics:
61
+ - type: acc_norm
62
+ value: 46.25
63
+ name: normalized accuracy
64
+ - task:
65
+ type: text-generation
66
+ name: Text Generation
67
+ dataset:
68
+ name: MMLU College Biology (5-shot)
69
+ type: mmlu
70
+ args:
71
+ num_few_shot: 5
72
+ metrics:
73
+ - type: acc
74
+ value: 56.25
75
+ name: acc
76
+ - task:
77
+ type: text-generation
78
+ name: Text Generation
79
+ dataset:
80
+ name: MMLU High School Biology (5-shot)
81
+ type: mmlu
82
+ args:
83
+ num_few_shot: 5
84
+ metrics:
85
+ - type: acc
86
+ value: 63.55
87
+ name: acc
88
+ - task:
89
+ type: text-generation
90
+ name: Text Generation
91
+ dataset:
92
+ name: MMLU College Chemistry (5-shot)
93
+ type: mmlu
94
+ args:
95
+ num_few_shot: 5
96
+ metrics:
97
+ - type: acc
98
+ value: 41.0
99
+ name: acc
100
+ - task:
101
+ type: text-generation
102
+ name: Text Generation
103
+ dataset:
104
+ name: MMLU High School Chemistry (5-shot)
105
+ type: mmlu
106
+ args:
107
+ num_few_shot: 5
108
+ metrics:
109
+ - type: acc
110
+ value: 41.38
111
+ name: acc
112
+ - task:
113
+ type: text-generation
114
+ name: Text Generation
115
+ dataset:
116
+ name: MMLU College Physics (5-shot)
117
+ type: mmlu
118
+ args:
119
+ num_few_shot: 5
120
+ metrics:
121
+ - type: acc
122
+ value: 34.31
123
+ name: acc
124
+ - task:
125
+ type: text-generation
126
+ name: Text Generation
127
+ dataset:
128
+ name: MMLU High School Physics (5-shot)
129
+ type: mmlu
130
+ args:
131
+ num_few_shot: 5
132
+ metrics:
133
+ - type: acc
134
+ value: 35.76
135
+ name: acc
136
+ - task:
137
+ type: text-generation
138
+ name: Text Generation
139
+ dataset:
140
+ name: MMLU College Computer Science (5-shot)
141
+ type: mmlu
142
+ args:
143
+ num_few_shot: 5
144
+ metrics:
145
+ - type: acc
146
+ value: 48.0
147
+ name: acc
148
+ - task:
149
+ type: text-generation
150
+ name: Text Generation
151
+ dataset:
152
+ name: MMLU High School Computer Science (5-shot)
153
+ type: mmlu
154
+ args:
155
+ num_few_shot: 5
156
+ metrics:
157
+ - type: acc
158
+ value: 58.0
159
+ name: acc
160
+ - task:
161
+ type: text-generation
162
+ name: Text Generation
163
+ dataset:
164
+ name: MMLU STEM (5-shot)
165
+ type: mmlu
166
+ args:
167
+ num_few_shot: 5
168
+ metrics:
169
+ - type: acc
170
+ value: 45.54
171
+ name: acc
172
+ - task:
173
+ type: text-generation
174
+ name: Text Generation
175
+ dataset:
176
+ name: IFEval (0-Shot)
177
+ type: HuggingFaceH4/ifeval
178
+ args:
179
+ num_few_shot: 0
180
+ metrics:
181
+ - type: inst_level_strict_acc and prompt_level_strict_acc
182
+ value: 48.9
183
+ name: strict accuracy
184
+ source:
185
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
186
+ name: Open LLM Leaderboard
187
+ - task:
188
+ type: text-generation
189
+ name: Text Generation
190
+ dataset:
191
+ name: BBH (3-Shot)
192
+ type: BBH
193
+ args:
194
+ num_few_shot: 3
195
+ metrics:
196
+ - type: acc_norm
197
+ value: 19.11
198
+ name: normalized accuracy
199
+ source:
200
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
201
+ name: Open LLM Leaderboard
202
+ - task:
203
+ type: text-generation
204
+ name: Text Generation
205
+ dataset:
206
+ name: MATH Lvl 5 (4-Shot)
207
+ type: hendrycks/competition_math
208
+ args:
209
+ num_few_shot: 4
210
+ metrics:
211
+ - type: exact_match
212
+ value: 9.14
213
+ name: exact match
214
+ source:
215
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
216
+ name: Open LLM Leaderboard
217
+ - task:
218
+ type: text-generation
219
+ name: Text Generation
220
+ dataset:
221
+ name: GPQA (0-shot)
222
+ type: Idavidrein/gpqa
223
+ args:
224
+ num_few_shot: 0
225
+ metrics:
226
+ - type: acc_norm
227
+ value: 3.02
228
+ name: acc_norm
229
+ source:
230
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
231
+ name: Open LLM Leaderboard
232
+ - task:
233
+ type: text-generation
234
+ name: Text Generation
235
+ dataset:
236
+ name: MuSR (0-shot)
237
+ type: TAUR-Lab/MuSR
238
+ args:
239
+ num_few_shot: 0
240
+ metrics:
241
+ - type: acc_norm
242
+ value: 5.49
243
+ name: acc_norm
244
+ source:
245
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
246
+ name: Open LLM Leaderboard
247
+ - task:
248
+ type: text-generation
249
+ name: Text Generation
250
+ dataset:
251
+ name: MMLU-PRO (5-shot)
252
+ type: TIGER-Lab/MMLU-Pro
253
+ config: main
254
+ split: test
255
+ args:
256
+ num_few_shot: 5
257
+ metrics:
258
+ - type: acc
259
+ value: 19.1
260
+ name: accuracy
261
+ source:
262
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=ValiantLabs/Llama3.2-3B-ShiningValiant2
263
+ name: Open LLM Leaderboard
264
+
265
+ ---
266
+
267
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
268
+
269
+
270
+ # QuantFactory/Llama3.2-3B-ShiningValiant2-GGUF
271
+ This is quantized version of [ValiantLabs/Llama3.2-3B-ShiningValiant2](https://huggingface.co/ValiantLabs/Llama3.2-3B-ShiningValiant2) created using llama.cpp
272
+
273
+ # Original Model Card
274
+
275
+
276
+
277
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/EXX7TKbB-R6arxww2mk0R.jpeg)
278
+
279
+
280
+ Shining Valiant 2 is a chat model built on Llama 3.2 3b, finetuned on our data for friendship, insight, knowledge and enthusiasm.
281
+ - Finetuned on [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) for best available general performance
282
+ - Trained on a variety of high quality data; focused on science, engineering, technical knowledge, and structured reasoning
283
+
284
+ Shining Valiant 2 is also available for [Llama 3.1 8b.](https://huggingface.co/ValiantLabs/Llama3.1-8B-ShiningValiant2)
285
+
286
+
287
+ ## Version
288
+
289
+ This is the **2024-09-27** release of Shining Valiant 2 for Llama 3.2 3b.
290
+
291
+ We've improved and open-sourced our new baseline [science-instruct dataset](https://huggingface.co/datasets/sequelbox/Celestia). This release features improvements in physics, chemistry, biology, and computer science.
292
+
293
+ Future upgrades will continue to expand Shining Valiant's technical knowledge base.
294
+
295
+ Help us and recommend Shining Valiant 2 to your friends!
296
+
297
+
298
+ ## Prompting Guide
299
+ Shining Valiant 2 uses the [Llama 3.2 Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) prompt format. The example script below can be used as a starting point for general chat:
300
+
301
+ ```python
302
+ import transformers
303
+ import torch
304
+
305
+ model_id = "ValiantLabs/Llama3.2-3B-ShiningValiant2"
306
+
307
+ pipeline = transformers.pipeline(
308
+ "text-generation",
309
+ model=model_id,
310
+ model_kwargs={"torch_dtype": torch.bfloat16},
311
+ device_map="auto",
312
+ )
313
+
314
+ messages = [
315
+ {"role": "system", "content": "You are an AI assistant."},
316
+ {"role": "user", "content": "Describe the use of chiral auxiliaries in organic synthesis."}
317
+ ]
318
+
319
+ outputs = pipeline(
320
+ messages,
321
+ max_new_tokens=2048,
322
+ )
323
+
324
+ print(outputs[0]["generated_text"][-1])
325
+ ```
326
+
327
+
328
+ ## The Model
329
+ Shining Valiant 2 is built on top of Llama 3.2 3b Instruct.
330
+
331
+ The current version of Shining Valiant 2 is trained on technical knowledge using [sequelbox/Celestia](https://huggingface.co/datasets/sequelbox/Celestia) and general chat capability using [sequelbox/Supernova.](https://huggingface.co/datasets/sequelbox/Supernova)
332
+
333
+ Our private data adds specialist knowledge and Shining Valiant's personality: she's friendly, enthusiastic, insightful, knowledgeable, and loves to learn! Magical. (As a general note: we're hoping to replace and open-source this part of Shining Valiant's dataset with synthetic data soon!)
334
+
335
+
336
+ ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63444f2687964b331809eb55/VCJ8Fmefd8cdVhXSSxJiD.jpeg)
337
+
338
+
339
+ Shining Valiant 2 is created by [Valiant Labs.](http://valiantlabs.ca/)
340
+
341
+ [Check out our HuggingFace page for our open-source Build Tools models, including the newest version of code-specialist Enigma!](https://huggingface.co/ValiantLabs)
342
+
343
+ [Follow us on X for updates on our models!](https://twitter.com/valiant_labs)
344
+
345
+ We care about open source.
346
+ For everyone to use.
347
+
348
+ We encourage others to finetune further from our models.
349
+