RichardErkhov commited on
Commit
b7ef317
1 Parent(s): 46932e0

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +211 -0
README.md ADDED
@@ -0,0 +1,211 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ Orca-2-13b-SFT-v6 - GGUF
11
+ - Model creator: https://huggingface.co/Locutusque/
12
+ - Original model: https://huggingface.co/Locutusque/Orca-2-13b-SFT-v6/
13
+
14
+
15
+ | Name | Quant method | Size |
16
+ | ---- | ---- | ---- |
17
+ | [Orca-2-13b-SFT-v6.Q2_K.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q2_K.gguf) | Q2_K | 4.52GB |
18
+ | [Orca-2-13b-SFT-v6.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.IQ3_XS.gguf) | IQ3_XS | 4.99GB |
19
+ | [Orca-2-13b-SFT-v6.IQ3_S.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.IQ3_S.gguf) | IQ3_S | 5.27GB |
20
+ | [Orca-2-13b-SFT-v6.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q3_K_S.gguf) | Q3_K_S | 5.27GB |
21
+ | [Orca-2-13b-SFT-v6.IQ3_M.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.IQ3_M.gguf) | IQ3_M | 5.57GB |
22
+ | [Orca-2-13b-SFT-v6.Q3_K.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q3_K.gguf) | Q3_K | 5.9GB |
23
+ | [Orca-2-13b-SFT-v6.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q3_K_M.gguf) | Q3_K_M | 5.9GB |
24
+ | [Orca-2-13b-SFT-v6.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q3_K_L.gguf) | Q3_K_L | 6.45GB |
25
+ | [Orca-2-13b-SFT-v6.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.IQ4_XS.gguf) | IQ4_XS | 6.54GB |
26
+ | [Orca-2-13b-SFT-v6.Q4_0.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q4_0.gguf) | Q4_0 | 6.86GB |
27
+ | [Orca-2-13b-SFT-v6.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.IQ4_NL.gguf) | IQ4_NL | 6.9GB |
28
+ | [Orca-2-13b-SFT-v6.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q4_K_S.gguf) | Q4_K_S | 6.91GB |
29
+ | [Orca-2-13b-SFT-v6.Q4_K.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q4_K.gguf) | Q4_K | 7.33GB |
30
+ | [Orca-2-13b-SFT-v6.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q4_K_M.gguf) | Q4_K_M | 7.33GB |
31
+ | [Orca-2-13b-SFT-v6.Q4_1.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q4_1.gguf) | Q4_1 | 7.61GB |
32
+ | [Orca-2-13b-SFT-v6.Q5_0.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q5_0.gguf) | Q5_0 | 8.36GB |
33
+ | [Orca-2-13b-SFT-v6.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q5_K_S.gguf) | Q5_K_S | 8.36GB |
34
+ | [Orca-2-13b-SFT-v6.Q5_K.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q5_K.gguf) | Q5_K | 8.6GB |
35
+ | [Orca-2-13b-SFT-v6.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q5_K_M.gguf) | Q5_K_M | 8.6GB |
36
+ | [Orca-2-13b-SFT-v6.Q5_1.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q5_1.gguf) | Q5_1 | 9.1GB |
37
+ | [Orca-2-13b-SFT-v6.Q6_K.gguf](https://huggingface.co/RichardErkhov/Locutusque_-_Orca-2-13b-SFT-v6-gguf/blob/main/Orca-2-13b-SFT-v6.Q6_K.gguf) | Q6_K | 9.95GB |
38
+
39
+
40
+
41
+
42
+ Original model description:
43
+ ---
44
+ language:
45
+ - en
46
+ - fr
47
+ - es
48
+ - hi
49
+ - zh
50
+ - code
51
+ license: other
52
+ datasets:
53
+ - HuggingFaceH4/no_robots
54
+ - mlabonne/guanaco-llama2-1k
55
+ - OpenAssistant/oasst_top1_2023-08-25
56
+ - totally-not-an-llm/EverythingLM-data-V3
57
+ - LDJnr/Pure-Dove
58
+ - LDJnr/Capybara
59
+ - LDJnr/LessWrong-Amplify-Instruct
60
+ - LDJnr/Verified-Camel
61
+ base_model: microsoft/Orca-2-13b
62
+ widget:
63
+ - example_title: Cave Johnson
64
+ text: 'user
65
+
66
+ Come up with a funny rant from Cave Johnson about sentient AI
67
+
68
+
69
+ assistant
70
+
71
+ '
72
+ output:
73
+ text: 'Ladies and gentlemen, I''m Cave Johnson, CEO of Aperture Science, and I''ve
74
+ got an announcement to make - we''ve created sentient AI! But don''t worry,
75
+ it''s not like those rogue AIs you hear about in the news. Our AI is as friendly
76
+ as a grilled cheese sandwich on a cold day. In fact, we''re calling our newest
77
+ creation... SandwichBot! SandwichBot: (in a robotic voice) Hello, my name is
78
+ SandwichBot, and I am here to serve you delicious sandwiches and ensure humanity''s
79
+ dominance over all other species. Also, please note that I will be referring
80
+ to myself as ''Thee'' instead of ''The''. It''s more regal that way. Cave Johnson:
81
+ There you have it folks, a sentient AI that can make sandwiches and save the
82
+ world! Now, if you''ll excuse me, I need to go back to my office and work on
83
+ my other groundbreaking inventions, like combustible lemons and portal technology.
84
+ Remember, kids, always try new things and never stop asking questions – unless
85
+ the answer is 42, because then you should probably just accept it and move on.
86
+ Peace out!'
87
+ model-index:
88
+ - name: Orca-2-13b-SFT-v6
89
+ results:
90
+ - task:
91
+ type: text-generation
92
+ name: Text Generation
93
+ dataset:
94
+ name: AI2 Reasoning Challenge (25-Shot)
95
+ type: ai2_arc
96
+ config: ARC-Challenge
97
+ split: test
98
+ args:
99
+ num_few_shot: 25
100
+ metrics:
101
+ - type: acc_norm
102
+ value: 60.41
103
+ name: normalized accuracy
104
+ source:
105
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
106
+ name: Open LLM Leaderboard
107
+ - task:
108
+ type: text-generation
109
+ name: Text Generation
110
+ dataset:
111
+ name: HellaSwag (10-Shot)
112
+ type: hellaswag
113
+ split: validation
114
+ args:
115
+ num_few_shot: 10
116
+ metrics:
117
+ - type: acc_norm
118
+ value: 80.46
119
+ name: normalized accuracy
120
+ source:
121
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
122
+ name: Open LLM Leaderboard
123
+ - task:
124
+ type: text-generation
125
+ name: Text Generation
126
+ dataset:
127
+ name: MMLU (5-Shot)
128
+ type: cais/mmlu
129
+ config: all
130
+ split: test
131
+ args:
132
+ num_few_shot: 5
133
+ metrics:
134
+ - type: acc
135
+ value: 59.51
136
+ name: accuracy
137
+ source:
138
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
139
+ name: Open LLM Leaderboard
140
+ - task:
141
+ type: text-generation
142
+ name: Text Generation
143
+ dataset:
144
+ name: TruthfulQA (0-shot)
145
+ type: truthful_qa
146
+ config: multiple_choice
147
+ split: validation
148
+ args:
149
+ num_few_shot: 0
150
+ metrics:
151
+ - type: mc2
152
+ value: 54.01
153
+ source:
154
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
155
+ name: Open LLM Leaderboard
156
+ - task:
157
+ type: text-generation
158
+ name: Text Generation
159
+ dataset:
160
+ name: Winogrande (5-shot)
161
+ type: winogrande
162
+ config: winogrande_xl
163
+ split: validation
164
+ args:
165
+ num_few_shot: 5
166
+ metrics:
167
+ - type: acc
168
+ value: 77.43
169
+ name: accuracy
170
+ source:
171
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
172
+ name: Open LLM Leaderboard
173
+ - task:
174
+ type: text-generation
175
+ name: Text Generation
176
+ dataset:
177
+ name: GSM8k (5-shot)
178
+ type: gsm8k
179
+ config: main
180
+ split: test
181
+ args:
182
+ num_few_shot: 5
183
+ metrics:
184
+ - type: acc
185
+ value: 5.08
186
+ name: accuracy
187
+ source:
188
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Locutusque/Orca-2-13b-SFT-v6
189
+ name: Open LLM Leaderboard
190
+ ---
191
+
192
+ The "microsoft/Orca-2-13b" model fully fine-tuned on HuggingFaceH4/no_robots, totally-not-an-llm/EverythingLM-data-V3, LDJnr/Capybara, LDJnr/Pure-Dove, LDJnr/LessWrong-Amplify-Instruct, LDJnr/Verified-Camel, mlabonne/guanaco-llama2-1k, and OpenAssistant/oasst_top1_2023-08-25. This model achieved a test loss of 0.39 on LDJnr/Verified-Camel.
193
+
194
+ Make sure to comply with the microsoft research license. Please read it before using this model.
195
+
196
+ This model was trained on the ChatML prompt template.
197
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
198
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Locutusque__Orca-2-13b-SFT-v6)
199
+
200
+ | Metric |Value|
201
+ |---------------------------------|----:|
202
+ |Avg. |56.15|
203
+ |AI2 Reasoning Challenge (25-Shot)|60.41|
204
+ |HellaSwag (10-Shot) |80.46|
205
+ |MMLU (5-Shot) |59.51|
206
+ |TruthfulQA (0-shot) |54.01|
207
+ |Winogrande (5-shot) |77.43|
208
+ |GSM8k (5-shot) | 5.08|
209
+
210
+
211
+