Commit
3dc2086
1 Parent(s): 612f3e5

Fixing some errors of the leaderboard evaluation results in the ModelCard yaml (#2)

Browse files

- Fixing some errors of the leaderboard evaluation results in the ModelCard yaml (4afb567534e4698ebccdcf6b3e3b185437eb00b5)


Co-authored-by: Open PT LLM Leaderboard PR Bot <leaderboard-pt-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +27 -1
README.md CHANGED
@@ -78,6 +78,19 @@ model-index:
78
  - type: f1_macro
79
  value: 91.42
80
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
81
  - type: pearson
82
  value: 80.85
83
  name: pearson
@@ -105,7 +118,7 @@ model-index:
105
  name: Text Generation
106
  dataset:
107
  name: HateBR Binary
108
- type: eduagarcia/portuguese_benchmark
109
  split: test
110
  args:
111
  num_few_shot: 25
@@ -113,6 +126,19 @@ model-index:
113
  - type: f1_macro
114
  value: 79.32
115
  name: f1-macro
 
 
 
 
 
 
 
 
 
 
 
 
 
116
  - type: f1_macro
117
  value: 71.8
118
  name: f1-macro
 
78
  - type: f1_macro
79
  value: 91.42
80
  name: f1-macro
81
+ source:
82
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicolasdec/CabraQwen14b
83
+ name: Open Portuguese LLM Leaderboard
84
+ - task:
85
+ type: text-generation
86
+ name: Text Generation
87
+ dataset:
88
+ name: Assin2 STS
89
+ type: eduagarcia/portuguese_benchmark
90
+ split: test
91
+ args:
92
+ num_few_shot: 15
93
+ metrics:
94
  - type: pearson
95
  value: 80.85
96
  name: pearson
 
118
  name: Text Generation
119
  dataset:
120
  name: HateBR Binary
121
+ type: ruanchaves/hatebr
122
  split: test
123
  args:
124
  num_few_shot: 25
 
126
  - type: f1_macro
127
  value: 79.32
128
  name: f1-macro
129
+ source:
130
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicolasdec/CabraQwen14b
131
+ name: Open Portuguese LLM Leaderboard
132
+ - task:
133
+ type: text-generation
134
+ name: Text Generation
135
+ dataset:
136
+ name: PT Hate Speech Binary
137
+ type: hate_speech_portuguese
138
+ split: test
139
+ args:
140
+ num_few_shot: 25
141
+ metrics:
142
  - type: f1_macro
143
  value: 71.8
144
  name: f1-macro