nicholasKluge commited on
Commit
6cf615f
·
1 Parent(s): 1474a13

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -18,9 +18,16 @@ widget:
18
  ---
19
  # TeenyTinyLlama-162m-HateBR
20
 
21
- TeenyTinyLlama is a series of small foundational models trained in Portuguese.
22
 
23
- This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) fine-tuned on a translated version of the [HateBR dataset](https://huggingface.co/datasets/ruanchaves/hatebr).
 
 
 
 
 
 
 
24
 
25
  ## Reproducing
26
 
@@ -117,7 +124,7 @@ trainer.train()
117
 
118
  ```
119
 
120
- ## Results
121
 
122
  | Models | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
123
  |--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
@@ -125,3 +132,26 @@ trainer.train()
125
  | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28 |
126
  | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 87.42 |
127
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  ---
19
  # TeenyTinyLlama-162m-HateBR
20
 
21
+ TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
22
 
23
+ This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) (`TeenyTinyLlama-162m-HateBR`) fine-tuned on the [HateBR dataset](https://huggingface.co/datasets/ruanchaves/hatebr).
24
+
25
+ ## Details
26
+
27
+ - **Number of Epochs:** 3
28
+ - **Batch size:** 16
29
+ - **Optimizer:** `torch.optim.AdamW` (learning_rate = 4e-5, epsilon = 1e-8)
30
+ - **GPU:** 1 NVIDIA A100-SXM4-40GB
31
 
32
  ## Reproducing
33
 
 
124
 
125
  ```
126
 
127
+ ## Fine-Tuning Comparisons
128
 
129
  | Models | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
130
  |--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
 
132
  | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28 |
133
  | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 87.42 |
134
 
135
+ ## Cite as 🤗
136
+
137
+ ```latex
138
+
139
+ @misc{nicholas22llama,
140
+ doi = {10.5281/zenodo.6989727},
141
+ url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m},
142
+ author = {Nicholas Kluge Corrêa},
143
+ title = {TeenyTinyLlama},
144
+ year = {2023},
145
+ publisher = {HuggingFace},
146
+ journal = {HuggingFace repository},
147
+ }
148
+
149
+ ```
150
+
151
+ ## Funding
152
+
153
+ This repository was built as part of the RAIES ([Rede de Inteligência Artificial Ética e Segura](https://www.raies.org/)) initiative, a project supported by FAPERGS - ([Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul](https://fapergs.rs.gov.br/inicial)), Brazil.
154
+
155
+ ## License
156
+
157
+ The TeenyTinyLlama-162m-HateBR is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.