adalbertojunior commited on
Commit
946cabd
1 Parent(s): 0119f90

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +166 -1
README.md CHANGED
@@ -1,9 +1,157 @@
 
1
  ---
2
  library_name: transformers
3
  datasets:
4
  - adalbertojunior/dolphin_pt_test
5
  language:
6
  - pt
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
  ---
8
 
9
  # Model Card for Llama-3-8B-Dolphin-Portuguese
@@ -50,4 +198,21 @@ outputs = pipeline(
50
  top_p=0.9,
51
  )
52
  print(outputs[0]["generated_text"][len(prompt):])
53
- ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
  ---
3
  library_name: transformers
4
  datasets:
5
  - adalbertojunior/dolphin_pt_test
6
  language:
7
  - pt
8
+ model-index:
9
+ - name: Llama-3-8B-Dolphin-Portuguese
10
+ results:
11
+ - task:
12
+ type: text-generation
13
+ name: Text Generation
14
+ dataset:
15
+ name: ENEM Challenge (No Images)
16
+ type: eduagarcia/enem_challenge
17
+ split: train
18
+ args:
19
+ num_few_shot: 3
20
+ metrics:
21
+ - type: acc
22
+ value: 66.83
23
+ name: accuracy
24
+ source:
25
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
26
+ name: Open Portuguese LLM Leaderboard
27
+ - task:
28
+ type: text-generation
29
+ name: Text Generation
30
+ dataset:
31
+ name: BLUEX (No Images)
32
+ type: eduagarcia-temp/BLUEX_without_images
33
+ split: train
34
+ args:
35
+ num_few_shot: 3
36
+ metrics:
37
+ - type: acc
38
+ value: 53.69
39
+ name: accuracy
40
+ source:
41
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
42
+ name: Open Portuguese LLM Leaderboard
43
+ - task:
44
+ type: text-generation
45
+ name: Text Generation
46
+ dataset:
47
+ name: OAB Exams
48
+ type: eduagarcia/oab_exams
49
+ split: train
50
+ args:
51
+ num_few_shot: 3
52
+ metrics:
53
+ - type: acc
54
+ value: 45.24
55
+ name: accuracy
56
+ source:
57
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
58
+ name: Open Portuguese LLM Leaderboard
59
+ - task:
60
+ type: text-generation
61
+ name: Text Generation
62
+ dataset:
63
+ name: Assin2 RTE
64
+ type: assin2
65
+ split: test
66
+ args:
67
+ num_few_shot: 15
68
+ metrics:
69
+ - type: f1_macro
70
+ value: 92.84
71
+ name: f1-macro
72
+ source:
73
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
74
+ name: Open Portuguese LLM Leaderboard
75
+ - task:
76
+ type: text-generation
77
+ name: Text Generation
78
+ dataset:
79
+ name: Assin2 STS
80
+ type: eduagarcia/portuguese_benchmark
81
+ split: test
82
+ args:
83
+ num_few_shot: 15
84
+ metrics:
85
+ - type: pearson
86
+ value: 75.92
87
+ name: pearson
88
+ source:
89
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
90
+ name: Open Portuguese LLM Leaderboard
91
+ - task:
92
+ type: text-generation
93
+ name: Text Generation
94
+ dataset:
95
+ name: FaQuAD NLI
96
+ type: ruanchaves/faquad-nli
97
+ split: test
98
+ args:
99
+ num_few_shot: 15
100
+ metrics:
101
+ - type: f1_macro
102
+ value: 79.67
103
+ name: f1-macro
104
+ source:
105
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
106
+ name: Open Portuguese LLM Leaderboard
107
+ - task:
108
+ type: text-generation
109
+ name: Text Generation
110
+ dataset:
111
+ name: HateBR Binary
112
+ type: ruanchaves/hatebr
113
+ split: test
114
+ args:
115
+ num_few_shot: 25
116
+ metrics:
117
+ - type: f1_macro
118
+ value: 88.04
119
+ name: f1-macro
120
+ source:
121
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
122
+ name: Open Portuguese LLM Leaderboard
123
+ - task:
124
+ type: text-generation
125
+ name: Text Generation
126
+ dataset:
127
+ name: PT Hate Speech Binary
128
+ type: hate_speech_portuguese
129
+ split: test
130
+ args:
131
+ num_few_shot: 25
132
+ metrics:
133
+ - type: f1_macro
134
+ value: 58.34
135
+ name: f1-macro
136
+ source:
137
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
138
+ name: Open Portuguese LLM Leaderboard
139
+ - task:
140
+ type: text-generation
141
+ name: Text Generation
142
+ dataset:
143
+ name: tweetSentBR
144
+ type: eduagarcia/tweetsentbr_fewshot
145
+ split: test
146
+ args:
147
+ num_few_shot: 25
148
+ metrics:
149
+ - type: f1_macro
150
+ value: 69.4
151
+ name: f1-macro
152
+ source:
153
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=adalbertojunior/Llama-3-8B-Dolphin-Portuguese
154
+ name: Open Portuguese LLM Leaderboard
155
  ---
156
 
157
  # Model Card for Llama-3-8B-Dolphin-Portuguese
 
198
  top_p=0.9,
199
  )
200
  print(outputs[0]["generated_text"][len(prompt):])
201
+ ```
202
+
203
+ # Open Portuguese LLM Leaderboard Evaluation Results
204
+
205
+ Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/adalbertojunior/Llama-3-8B-Dolphin-Portuguese) and on the [🚀 Open Portuguese LLM Leaderboard](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
206
+
207
+ | Metric | Value |
208
+ |--------------------------|--------|
209
+ |Average |**70.0**|
210
+ |ENEM Challenge (No Images)| 66.83|
211
+ |BLUEX (No Images) | 53.69|
212
+ |OAB Exams | 45.24|
213
+ |Assin2 RTE | 92.84|
214
+ |Assin2 STS | 75.92|
215
+ |FaQuAD NLI | 79.67|
216
+ |HateBR Binary | 88.04|
217
+ |PT Hate Speech Binary | 58.34|
218
+ |tweetSentBR | 69.40|