Nabokov commited on
Commit
bf6b439
1 Parent(s): b454b39

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +263 -0
README.md ADDED
@@ -0,0 +1,263 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ license: mit
5
+ library_name: transformers
6
+ tags:
7
+ - axolotl
8
+ - finetune
9
+ - dpo
10
+ - microsoft
11
+ - phi
12
+ - pytorch
13
+ - phi-3
14
+ - nlp
15
+ - code
16
+ - chatml
17
+ - llama-cpp
18
+ - gguf-my-repo
19
+ base_model: MaziyarPanahi/calme-2.1-phi3-4b
20
+ pipeline_tag: text-generation
21
+ inference: false
22
+ model_creator: MaziyarPanahi
23
+ quantized_by: MaziyarPanahi
24
+ model-index:
25
+ - name: calme-2.1-phi3-4b
26
+ results:
27
+ - task:
28
+ type: text-generation
29
+ name: Text Generation
30
+ dataset:
31
+ name: AI2 Reasoning Challenge (25-Shot)
32
+ type: ai2_arc
33
+ config: ARC-Challenge
34
+ split: test
35
+ args:
36
+ num_few_shot: 25
37
+ metrics:
38
+ - type: acc_norm
39
+ value: 62.63
40
+ name: normalized accuracy
41
+ source:
42
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
43
+ name: Open LLM Leaderboard
44
+ - task:
45
+ type: text-generation
46
+ name: Text Generation
47
+ dataset:
48
+ name: HellaSwag (10-Shot)
49
+ type: hellaswag
50
+ split: validation
51
+ args:
52
+ num_few_shot: 10
53
+ metrics:
54
+ - type: acc_norm
55
+ value: 81.07
56
+ name: normalized accuracy
57
+ source:
58
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
59
+ name: Open LLM Leaderboard
60
+ - task:
61
+ type: text-generation
62
+ name: Text Generation
63
+ dataset:
64
+ name: MMLU (5-Shot)
65
+ type: cais/mmlu
66
+ config: all
67
+ split: test
68
+ args:
69
+ num_few_shot: 5
70
+ metrics:
71
+ - type: acc
72
+ value: 68.96
73
+ name: accuracy
74
+ source:
75
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
76
+ name: Open LLM Leaderboard
77
+ - task:
78
+ type: text-generation
79
+ name: Text Generation
80
+ dataset:
81
+ name: TruthfulQA (0-shot)
82
+ type: truthful_qa
83
+ config: multiple_choice
84
+ split: validation
85
+ args:
86
+ num_few_shot: 0
87
+ metrics:
88
+ - type: mc2
89
+ value: 61.48
90
+ source:
91
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
92
+ name: Open LLM Leaderboard
93
+ - task:
94
+ type: text-generation
95
+ name: Text Generation
96
+ dataset:
97
+ name: Winogrande (5-shot)
98
+ type: winogrande
99
+ config: winogrande_xl
100
+ split: validation
101
+ args:
102
+ num_few_shot: 5
103
+ metrics:
104
+ - type: acc
105
+ value: 71.03
106
+ name: accuracy
107
+ source:
108
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
109
+ name: Open LLM Leaderboard
110
+ - task:
111
+ type: text-generation
112
+ name: Text Generation
113
+ dataset:
114
+ name: GSM8k (5-shot)
115
+ type: gsm8k
116
+ config: main
117
+ split: test
118
+ args:
119
+ num_few_shot: 5
120
+ metrics:
121
+ - type: acc
122
+ value: 72.25
123
+ name: accuracy
124
+ source:
125
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
126
+ name: Open LLM Leaderboard
127
+ - task:
128
+ type: text-generation
129
+ name: Text Generation
130
+ dataset:
131
+ name: IFEval (0-Shot)
132
+ type: HuggingFaceH4/ifeval
133
+ args:
134
+ num_few_shot: 0
135
+ metrics:
136
+ - type: inst_level_strict_acc and prompt_level_strict_acc
137
+ value: 55.25
138
+ name: strict accuracy
139
+ source:
140
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
141
+ name: Open LLM Leaderboard
142
+ - task:
143
+ type: text-generation
144
+ name: Text Generation
145
+ dataset:
146
+ name: BBH (3-Shot)
147
+ type: BBH
148
+ args:
149
+ num_few_shot: 3
150
+ metrics:
151
+ - type: acc_norm
152
+ value: 38.12
153
+ name: normalized accuracy
154
+ source:
155
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
156
+ name: Open LLM Leaderboard
157
+ - task:
158
+ type: text-generation
159
+ name: Text Generation
160
+ dataset:
161
+ name: MATH Lvl 5 (4-Shot)
162
+ type: hendrycks/competition_math
163
+ args:
164
+ num_few_shot: 4
165
+ metrics:
166
+ - type: exact_match
167
+ value: 4.31
168
+ name: exact match
169
+ source:
170
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
171
+ name: Open LLM Leaderboard
172
+ - task:
173
+ type: text-generation
174
+ name: Text Generation
175
+ dataset:
176
+ name: GPQA (0-shot)
177
+ type: Idavidrein/gpqa
178
+ args:
179
+ num_few_shot: 0
180
+ metrics:
181
+ - type: acc_norm
182
+ value: 10.63
183
+ name: acc_norm
184
+ source:
185
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
186
+ name: Open LLM Leaderboard
187
+ - task:
188
+ type: text-generation
189
+ name: Text Generation
190
+ dataset:
191
+ name: MuSR (0-shot)
192
+ type: TAUR-Lab/MuSR
193
+ args:
194
+ num_few_shot: 0
195
+ metrics:
196
+ - type: acc_norm
197
+ value: 8.26
198
+ name: acc_norm
199
+ source:
200
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
201
+ name: Open LLM Leaderboard
202
+ - task:
203
+ type: text-generation
204
+ name: Text Generation
205
+ dataset:
206
+ name: MMLU-PRO (5-shot)
207
+ type: TIGER-Lab/MMLU-Pro
208
+ config: main
209
+ split: test
210
+ args:
211
+ num_few_shot: 5
212
+ metrics:
213
+ - type: acc
214
+ value: 30.51
215
+ name: accuracy
216
+ source:
217
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=MaziyarPanahi/calme-2.1-phi3-4b
218
+ name: Open LLM Leaderboard
219
+ ---
220
+
221
+ # Nabokov/calme-2.1-phi3-4b-Q8_0-GGUF
222
+ This model was converted to GGUF format from [`MaziyarPanahi/calme-2.1-phi3-4b`](https://huggingface.co/MaziyarPanahi/calme-2.1-phi3-4b) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
223
+ Refer to the [original model card](https://huggingface.co/MaziyarPanahi/calme-2.1-phi3-4b) for more details on the model.
224
+
225
+ ## Use with llama.cpp
226
+ Install llama.cpp through brew (works on Mac and Linux)
227
+
228
+ ```bash
229
+ brew install llama.cpp
230
+
231
+ ```
232
+ Invoke the llama.cpp server or the CLI.
233
+
234
+ ### CLI:
235
+ ```bash
236
+ llama-cli --hf-repo Nabokov/calme-2.1-phi3-4b-Q8_0-GGUF --hf-file calme-2.1-phi3-4b-q8_0.gguf -p "The meaning to life and the universe is"
237
+ ```
238
+
239
+ ### Server:
240
+ ```bash
241
+ llama-server --hf-repo Nabokov/calme-2.1-phi3-4b-Q8_0-GGUF --hf-file calme-2.1-phi3-4b-q8_0.gguf -c 2048
242
+ ```
243
+
244
+ Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
245
+
246
+ Step 1: Clone llama.cpp from GitHub.
247
+ ```
248
+ git clone https://github.com/ggerganov/llama.cpp
249
+ ```
250
+
251
+ Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
252
+ ```
253
+ cd llama.cpp && LLAMA_CURL=1 make
254
+ ```
255
+
256
+ Step 3: Run inference through the main binary.
257
+ ```
258
+ ./llama-cli --hf-repo Nabokov/calme-2.1-phi3-4b-Q8_0-GGUF --hf-file calme-2.1-phi3-4b-q8_0.gguf -p "The meaning to life and the universe is"
259
+ ```
260
+ or
261
+ ```
262
+ ./llama-server --hf-repo Nabokov/calme-2.1-phi3-4b-Q8_0-GGUF --hf-file calme-2.1-phi3-4b-q8_0.gguf -c 2048
263
+ ```