RichardErkhov commited on
Commit
1570f8f
1 Parent(s): 8c9259a

uploaded readme

Browse files
Files changed (1) hide show
  1. README.md +263 -0
README.md ADDED
@@ -0,0 +1,263 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Quantization made by Richard Erkhov.
2
+
3
+ [Github](https://github.com/RichardErkhov)
4
+
5
+ [Discord](https://discord.gg/pvy7H8DZMG)
6
+
7
+ [Request more models](https://github.com/RichardErkhov/quant_request)
8
+
9
+
10
+ Llama-3-8B-Dolphin-Portuguese - GGUF
11
+ - Model creator: https://huggingface.co/adalbertojunior/
12
+ - Original model: https://huggingface.co/adalbertojunior/Llama-3-8B-Dolphin-Portuguese/
13
+
14
+
15
+ | Name | Quant method | Size |
16
+ | ---- | ---- | ---- |
17
+ | [Llama-3-8B-Dolphin-Portuguese.Q2_K.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q2_K.gguf) | Q2_K | 2.96GB |
18
+ | [Llama-3-8B-Dolphin-Portuguese.IQ3_XS.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.IQ3_XS.gguf) | IQ3_XS | 3.28GB |
19
+ | [Llama-3-8B-Dolphin-Portuguese.IQ3_S.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.IQ3_S.gguf) | IQ3_S | 3.43GB |
20
+ | [Llama-3-8B-Dolphin-Portuguese.Q3_K_S.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q3_K_S.gguf) | Q3_K_S | 3.41GB |
21
+ | [Llama-3-8B-Dolphin-Portuguese.IQ3_M.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.IQ3_M.gguf) | IQ3_M | 3.52GB |
22
+ | [Llama-3-8B-Dolphin-Portuguese.Q3_K.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q3_K.gguf) | Q3_K | 3.74GB |
23
+ | [Llama-3-8B-Dolphin-Portuguese.Q3_K_M.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q3_K_M.gguf) | Q3_K_M | 3.74GB |
24
+ | [Llama-3-8B-Dolphin-Portuguese.Q3_K_L.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q3_K_L.gguf) | Q3_K_L | 4.03GB |
25
+ | [Llama-3-8B-Dolphin-Portuguese.IQ4_XS.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.IQ4_XS.gguf) | IQ4_XS | 4.18GB |
26
+ | [Llama-3-8B-Dolphin-Portuguese.Q4_0.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q4_0.gguf) | Q4_0 | 4.34GB |
27
+ | [Llama-3-8B-Dolphin-Portuguese.IQ4_NL.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.IQ4_NL.gguf) | IQ4_NL | 4.38GB |
28
+ | [Llama-3-8B-Dolphin-Portuguese.Q4_K_S.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q4_K_S.gguf) | Q4_K_S | 4.37GB |
29
+ | [Llama-3-8B-Dolphin-Portuguese.Q4_K.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q4_K.gguf) | Q4_K | 4.58GB |
30
+ | [Llama-3-8B-Dolphin-Portuguese.Q4_K_M.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q4_K_M.gguf) | Q4_K_M | 4.58GB |
31
+ | [Llama-3-8B-Dolphin-Portuguese.Q4_1.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q4_1.gguf) | Q4_1 | 4.78GB |
32
+ | [Llama-3-8B-Dolphin-Portuguese.Q5_0.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q5_0.gguf) | Q5_0 | 5.21GB |
33
+ | [Llama-3-8B-Dolphin-Portuguese.Q5_K_S.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q5_K_S.gguf) | Q5_K_S | 5.21GB |
34
+ | [Llama-3-8B-Dolphin-Portuguese.Q5_K.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q5_K.gguf) | Q5_K | 5.34GB |
35
+ | [Llama-3-8B-Dolphin-Portuguese.Q5_K_M.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q5_K_M.gguf) | Q5_K_M | 5.34GB |
36
+ | [Llama-3-8B-Dolphin-Portuguese.Q5_1.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q5_1.gguf) | Q5_1 | 5.65GB |
37
+ | [Llama-3-8B-Dolphin-Portuguese.Q6_K.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q6_K.gguf) | Q6_K | 6.14GB |
38
+ | [Llama-3-8B-Dolphin-Portuguese.Q8_0.gguf](https://huggingface.co/RichardErkhov/adalbertojunior_-_Llama-3-8B-Dolphin-Portuguese-gguf/blob/main/Llama-3-8B-Dolphin-Portuguese.Q8_0.gguf) | Q8_0 | 7.95GB |
39
+
40
+
41
+
42
+
43
+ Original model description:
44
+
45
+ ---
46
+ library_name: transformers
47
+ datasets:
48
+ - adalbertojunior/dolphin_pt_test
49
+ language:
50
+ - pt
51
+ model-index:
52
+ - name: Llama-3-8B-Dolphin-Portuguese
53
+ results:
54
+ - task:
55
+ type: text-generation
56
+ name: Text Generation
57
+ dataset:
58
+ name: ENEM Challenge (No Images)
59
+ type: eduagarcia/enem_challenge
60
+ split: train
61
+ args:
62
+ num_few_shot: 3
63
+ metrics:
64
+ - type: acc
65
+ value: 66.83
66
+ name: accuracy
67
+ source:
68
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
69
+ name: Open Portuguese LLM Leaderboard
70
+ - task:
71
+ type: text-generation
72
+ name: Text Generation
73
+ dataset:
74
+ name: BLUEX (No Images)
75
+ type: eduagarcia-temp/BLUEX_without_images
76
+ split: train
77
+ args:
78
+ num_few_shot: 3
79
+ metrics:
80
+ - type: acc
81
+ value: 53.69
82
+ name: accuracy
83
+ source:
84
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
85
+ name: Open Portuguese LLM Leaderboard
86
+ - task:
87
+ type: text-generation
88
+ name: Text Generation
89
+ dataset:
90
+ name: OAB Exams
91
+ type: eduagarcia/oab_exams
92
+ split: train
93
+ args:
94
+ num_few_shot: 3
95
+ metrics:
96
+ - type: acc
97
+ value: 45.24
98
+ name: accuracy
99
+ source:
100
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
101
+ name: Open Portuguese LLM Leaderboard
102
+ - task:
103
+ type: text-generation
104
+ name: Text Generation
105
+ dataset:
106
+ name: Assin2 RTE
107
+ type: assin2
108
+ split: test
109
+ args:
110
+ num_few_shot: 15
111
+ metrics:
112
+ - type: f1_macro
113
+ value: 92.84
114
+ name: f1-macro
115
+ source:
116
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
117
+ name: Open Portuguese LLM Leaderboard
118
+ - task:
119
+ type: text-generation
120
+ name: Text Generation
121
+ dataset:
122
+ name: Assin2 STS
123
+ type: eduagarcia/portuguese_benchmark
124
+ split: test
125
+ args:
126
+ num_few_shot: 15
127
+ metrics:
128
+ - type: pearson
129
+ value: 75.92
130
+ name: pearson
131
+ source:
132
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
133
+ name: Open Portuguese LLM Leaderboard
134
+ - task:
135
+ type: text-generation
136
+ name: Text Generation
137
+ dataset:
138
+ name: FaQuAD NLI
139
+ type: ruanchaves/faquad-nli
140
+ split: test
141
+ args:
142
+ num_few_shot: 15
143
+ metrics:
144
+ - type: f1_macro
145
+ value: 79.67
146
+ name: f1-macro
147
+ source:
148
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
149
+ name: Open Portuguese LLM Leaderboard
150
+ - task:
151
+ type: text-generation
152
+ name: Text Generation
153
+ dataset:
154
+ name: HateBR Binary
155
+ type: ruanchaves/hatebr
156
+ split: test
157
+ args:
158
+ num_few_shot: 25
159
+ metrics:
160
+ - type: f1_macro
161
+ value: 88.04
162
+ name: f1-macro
163
+ source:
164
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
165
+ name: Open Portuguese LLM Leaderboard
166
+ - task:
167
+ type: text-generation
168
+ name: Text Generation
169
+ dataset:
170
+ name: PT Hate Speech Binary
171
+ type: hate_speech_portuguese
172
+ split: test
173
+ args:
174
+ num_few_shot: 25
175
+ metrics:
176
+ - type: f1_macro
177
+ value: 58.34
178
+ name: f1-macro
179
+ source:
180
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
181
+ name: Open Portuguese LLM Leaderboard
182
+ - task:
183
+ type: text-generation
184
+ name: Text Generation
185
+ dataset:
186
+ name: tweetSentBR
187
+ type: eduagarcia/tweetsentbr_fewshot
188
+ split: test
189
+ args:
190
+ num_few_shot: 25
191
+ metrics:
192
+ - type: f1_macro
193
+ value: 69.4
194
+ name: f1-macro
195
+ source:
196
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
197
+ name: Open Portuguese LLM Leaderboard
198
+ ---
199
+
200
+ # Model Card for Llama-3-8B-Dolphin-Portuguese
201
+
202
+ Model Trained on a translated version of dolphin dataset.
203
+
204
+
205
+ ## Usage
206
+ ```python
207
+ import transformers
208
+ import torch
209
+
210
+ model_id = "adalbertojunior/Llama-3-8B-Dolphin-Portuguese"
211
+
212
+ pipeline = transformers.pipeline(
213
+ "text-generation",
214
+ model=model_id,
215
+ model_kwargs={"torch_dtype": torch.bfloat16},
216
+ device_map="auto",
217
+ )
218
+
219
+ messages = [
220
+ {"role": "system", "content": "Você é um robô pirata que sempre responde como um pirata deveria!"},
221
+ {"role": "user", "content": "Quem é você?"},
222
+ ]
223
+
224
+ prompt = pipeline.tokenizer.apply_chat_template(
225
+ messages,
226
+ tokenize=False,
227
+ add_generation_prompt=True
228
+ )
229
+
230
+ terminators = [
231
+ pipeline.tokenizer.eos_token_id,
232
+ pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
233
+ ]
234
+
235
+ outputs = pipeline(
236
+ prompt,
237
+ max_new_tokens=256,
238
+ eos_token_id=terminators,
239
+ do_sample=True,
240
+ temperature=0.6,
241
+ top_p=0.9,
242
+ )
243
+ print(outputs[0]["generated_text"][len(prompt):])
244
+ ```
245
+
246
+ # Open Portuguese LLM Leaderboard Evaluation Results
247
+
248
+ Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/adalbertojunior/Llama-3-8B-Dolphin-Portuguese) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
249
+
250
+ | Metric | Value |
251
+ |--------------------------|--------|
252
+ |Average |**70.0**|
253
+ |ENEM Challenge (No Images)| 66.83|
254
+ |BLUEX (No Images) | 53.69|
255
+ |OAB Exams | 45.24|
256
+ |Assin2 RTE | 92.84|
257
+ |Assin2 STS | 75.92|
258
+ |FaQuAD NLI | 79.67|
259
+ |HateBR Binary | 88.04|
260
+ |PT Hate Speech Binary | 58.34|
261
+ |tweetSentBR | 69.40|
262
+
263
+