leaderboard-pt-pr-bot commited on
Commit
a06ba8f
1 Parent(s): 7022940

Adding the Open Portuguese LLM Leaderboard Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard

The purpose of this PR is to add evaluation results from the Open Portuguese LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/eduagarcia-temp/portuguese-leaderboard-results-to-modelcard/discussions

Files changed (1) hide show
  1. README.md +146 -9
README.md CHANGED
@@ -1,21 +1,21 @@
1
  ---
 
 
2
  license: apache-2.0
 
 
 
3
  datasets:
4
  - nicholasKluge/Pt-Corpus-Instruct
5
- language:
6
- - pt
7
  metrics:
8
  - perplexity
9
- library_name: transformers
10
  pipeline_tag: text-generation
11
- tags:
12
- - text-generation-inference
13
  widget:
14
- - text: "A PUCRS é uma universidade "
15
  example_title: Exemplo
16
- - text: "A muitos anos atrás, em uma galáxia muito distante, vivia uma raça de"
17
  example_title: Exemplo
18
- - text: "Em meio a um escândalo, a frente parlamentar pediu ao Senador Silva para"
19
  example_title: Exemplo
20
  inference:
21
  parameters:
@@ -30,6 +30,127 @@ co2_eq_emissions:
30
  training_type: pre-training
31
  geographical_location: Germany
32
  hardware_used: NVIDIA A100-SXM4-40GB
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  ---
34
  # TeenyTinyLlama-460m
35
 
@@ -224,4 +345,20 @@ This repository was built as part of the RAIES ([Rede de Inteligência Artificia
224
 
225
  ## License
226
 
227
- TeenyTinyLlama-460m is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language:
3
+ - pt
4
  license: apache-2.0
5
+ library_name: transformers
6
+ tags:
7
+ - text-generation-inference
8
  datasets:
9
  - nicholasKluge/Pt-Corpus-Instruct
 
 
10
  metrics:
11
  - perplexity
 
12
  pipeline_tag: text-generation
 
 
13
  widget:
14
+ - text: 'A PUCRS é uma universidade '
15
  example_title: Exemplo
16
+ - text: A muitos anos atrás, em uma galáxia muito distante, vivia uma raça de
17
  example_title: Exemplo
18
+ - text: Em meio a um escândalo, a frente parlamentar pediu ao Senador Silva para
19
  example_title: Exemplo
20
  inference:
21
  parameters:
 
30
  training_type: pre-training
31
  geographical_location: Germany
32
  hardware_used: NVIDIA A100-SXM4-40GB
33
+ model-index:
34
+ - name: TeenyTinyLlama-460m
35
+ results:
36
+ - task:
37
+ type: text-generation
38
+ name: Text Generation
39
+ dataset:
40
+ name: ENEM Challenge (No Images)
41
+ type: eduagarcia/enem_challenge
42
+ split: train
43
+ args:
44
+ num_few_shot: 3
45
+ metrics:
46
+ - type: acc
47
+ value: 20.15
48
+ name: accuracy
49
+ source:
50
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
51
+ name: Open Portuguese LLM Leaderboard
52
+ - task:
53
+ type: text-generation
54
+ name: Text Generation
55
+ dataset:
56
+ name: BLUEX (No Images)
57
+ type: eduagarcia-temp/BLUEX_without_images
58
+ split: train
59
+ args:
60
+ num_few_shot: 3
61
+ metrics:
62
+ - type: acc
63
+ value: 25.73
64
+ name: accuracy
65
+ source:
66
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
67
+ name: Open Portuguese LLM Leaderboard
68
+ - task:
69
+ type: text-generation
70
+ name: Text Generation
71
+ dataset:
72
+ name: OAB Exams
73
+ type: eduagarcia/oab_exams
74
+ split: train
75
+ args:
76
+ num_few_shot: 3
77
+ metrics:
78
+ - type: acc
79
+ value: 27.02
80
+ name: accuracy
81
+ source:
82
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
83
+ name: Open Portuguese LLM Leaderboard
84
+ - task:
85
+ type: text-generation
86
+ name: Text Generation
87
+ dataset:
88
+ name: Assin2 RTE
89
+ type: assin2
90
+ split: test
91
+ args:
92
+ num_few_shot: 15
93
+ metrics:
94
+ - type: f1_macro
95
+ value: 53.61
96
+ name: f1-macro
97
+ - type: pearson
98
+ value: 13.0
99
+ name: pearson
100
+ source:
101
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
102
+ name: Open Portuguese LLM Leaderboard
103
+ - task:
104
+ type: text-generation
105
+ name: Text Generation
106
+ dataset:
107
+ name: FaQuAD NLI
108
+ type: ruanchaves/faquad-nli
109
+ split: test
110
+ args:
111
+ num_few_shot: 15
112
+ metrics:
113
+ - type: f1_macro
114
+ value: 46.41
115
+ name: f1-macro
116
+ source:
117
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
118
+ name: Open Portuguese LLM Leaderboard
119
+ - task:
120
+ type: text-generation
121
+ name: Text Generation
122
+ dataset:
123
+ name: HateBR Binary
124
+ type: eduagarcia/portuguese_benchmark
125
+ split: test
126
+ args:
127
+ num_few_shot: 25
128
+ metrics:
129
+ - type: f1_macro
130
+ value: 33.59
131
+ name: f1-macro
132
+ - type: f1_macro
133
+ value: 22.99
134
+ name: f1-macro
135
+ source:
136
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
137
+ name: Open Portuguese LLM Leaderboard
138
+ - task:
139
+ type: text-generation
140
+ name: Text Generation
141
+ dataset:
142
+ name: tweetSentBR
143
+ type: eduagarcia-temp/tweetsentbr
144
+ split: test
145
+ args:
146
+ num_few_shot: 25
147
+ metrics:
148
+ - type: f1_macro
149
+ value: 17.28
150
+ name: f1-macro
151
+ source:
152
+ url: https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard?query=nicholasKluge/TeenyTinyLlama-460m
153
+ name: Open Portuguese LLM Leaderboard
154
  ---
155
  # TeenyTinyLlama-460m
156
 
 
345
 
346
  ## License
347
 
348
+ TeenyTinyLlama-460m is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
349
+ # [Open Portuguese LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/eduagarcia/open_pt_llm_leaderboard)
350
+ Detailed results can be found [here](https://huggingface.co/datasets/eduagarcia-temp/llm_pt_leaderboard_raw_results/tree/main/nicholasKluge/TeenyTinyLlama-460m)
351
+
352
+ | Metric | Value |
353
+ |--------------------------|---------|
354
+ |Average |**28.86**|
355
+ |ENEM Challenge (No Images)| 20.15|
356
+ |BLUEX (No Images) | 25.73|
357
+ |OAB Exams | 27.02|
358
+ |Assin2 RTE | 53.61|
359
+ |Assin2 STS | 13|
360
+ |FaQuAD NLI | 46.41|
361
+ |HateBR Binary | 33.59|
362
+ |PT Hate Speech Binary | 22.99|
363
+ |tweetSentBR | 17.28|
364
+