nicholasKluge
commited on
Commit
·
6cf615f
1
Parent(s):
1474a13
Update README.md
Browse files
README.md
CHANGED
@@ -18,9 +18,16 @@ widget:
|
|
18 |
---
|
19 |
# TeenyTinyLlama-162m-HateBR
|
20 |
|
21 |
-
TeenyTinyLlama is a series of small foundational models trained in Portuguese.
|
22 |
|
23 |
-
This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) fine-tuned on
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
24 |
|
25 |
## Reproducing
|
26 |
|
@@ -117,7 +124,7 @@ trainer.train()
|
|
117 |
|
118 |
```
|
119 |
|
120 |
-
##
|
121 |
|
122 |
| Models | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
|
123 |
|--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
|
@@ -125,3 +132,26 @@ trainer.train()
|
|
125 |
| [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28 |
|
126 |
| [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 87.42 |
|
127 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
---
|
19 |
# TeenyTinyLlama-162m-HateBR
|
20 |
|
21 |
+
TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
|
22 |
|
23 |
+
This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) (`TeenyTinyLlama-162m-HateBR`) fine-tuned on the [HateBR dataset](https://huggingface.co/datasets/ruanchaves/hatebr).
|
24 |
+
|
25 |
+
## Details
|
26 |
+
|
27 |
+
- **Number of Epochs:** 3
|
28 |
+
- **Batch size:** 16
|
29 |
+
- **Optimizer:** `torch.optim.AdamW` (learning_rate = 4e-5, epsilon = 1e-8)
|
30 |
+
- **GPU:** 1 NVIDIA A100-SXM4-40GB
|
31 |
|
32 |
## Reproducing
|
33 |
|
|
|
124 |
|
125 |
```
|
126 |
|
127 |
+
## Fine-Tuning Comparisons
|
128 |
|
129 |
| Models | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
|
130 |
|--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
|
|
|
132 |
| [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28 |
|
133 |
| [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese) | 87.42 |
|
134 |
|
135 |
+
## Cite as 🤗
|
136 |
+
|
137 |
+
```latex
|
138 |
+
|
139 |
+
@misc{nicholas22llama,
|
140 |
+
doi = {10.5281/zenodo.6989727},
|
141 |
+
url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m},
|
142 |
+
author = {Nicholas Kluge Corrêa},
|
143 |
+
title = {TeenyTinyLlama},
|
144 |
+
year = {2023},
|
145 |
+
publisher = {HuggingFace},
|
146 |
+
journal = {HuggingFace repository},
|
147 |
+
}
|
148 |
+
|
149 |
+
```
|
150 |
+
|
151 |
+
## Funding
|
152 |
+
|
153 |
+
This repository was built as part of the RAIES ([Rede de Inteligência Artificial Ética e Segura](https://www.raies.org/)) initiative, a project supported by FAPERGS - ([Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul](https://fapergs.rs.gov.br/inicial)), Brazil.
|
154 |
+
|
155 |
+
## License
|
156 |
+
|
157 |
+
The TeenyTinyLlama-162m-HateBR is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.
|