leaderboard-pt-pr-bot commited on
Commit
93443e7
1 Parent(s): ac9a46b

Fixing some errors of the leaderboard evaluation results in the ModelCard yaml

Browse files

The name of a few benchmarks are incorrect on the model metadata. This commit fixes some minor errors of the [last PR](1) on the ModelCard YAML metadata.

Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -94,6 +94,19 @@ model-index:
94
  - type: f1_macro
95
  value: 53.97
96
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
97
  - type: pearson
98
  value: 0.24
99
  name: pearson
@@ -121,7 +134,7 @@ model-index:
121
  name: Text Generation
122
  dataset:
123
  name: HateBR Binary
124
- type: eduagarcia/portuguese_benchmark
125
  split: test
126
  args:
127
  num_few_shot: 25
@@ -129,6 +142,19 @@ model-index:
129
  - type: f1_macro
130
  value: 36.92
131
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
132
  - type: f1_macro
133
  value: 42.63
134
  name: f1-macro
 
94
  - type: f1_macro
95
  value: 53.97
96
  name: f1-macro
97
+ source:
98
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-160m
99
+ name: Open Portuguese LLM Leaderboard
100
+ - task:
101
+ type: text-generation
102
+ name: Text Generation
103
+ dataset:
104
+ name: Assin2 STS
105
+ type: eduagarcia/portuguese_benchmark
106
+ split: test
107
+ args:
108
+ num_few_shot: 15
109
+ metrics:
110
  - type: pearson
111
  value: 0.24
112
  name: pearson
 
134
  name: Text Generation
135
  dataset:
136
  name: HateBR Binary
137
+ type: ruanchaves/hatebr
138
  split: test
139
  args:
140
  num_few_shot: 25
 
142
  - type: f1_macro
143
  value: 36.92
144
  name: f1-macro
145
+ source:
146
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-160m
147
+ name: Open Portuguese LLM Leaderboard
148
+ - task:
149
+ type: text-generation
150
+ name: Text Generation
151
+ dataset:
152
+ name: PT Hate Speech Binary
153
+ type: hate_speech_portuguese
154
+ split: test
155
+ args:
156
+ num_few_shot: 25
157
+ metrics:
158
  - type: f1_macro
159
  value: 42.63
160
  name: f1-macro