nicholasKluge
/

TeenyTinyLlama-160m-HateBR

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nicholasKluge commited on Dec 27, 2023

Commit

6cf615f

·

1 Parent(s): 1474a13

Update README.md

Files changed (1) hide show

README.md +33 -3

README.md CHANGED Viewed

@@ -18,9 +18,16 @@ widget:
 ---
 # TeenyTinyLlama-162m-HateBR
-TeenyTinyLlama is a series of small foundational models trained in Portuguese.
-This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) fine-tuned on a translated version of the [HateBR dataset](https://huggingface.co/datasets/ruanchaves/hatebr).
 ## Reproducing
@@ -117,7 +124,7 @@ trainer.train()
 ```
-## Results
 | Models                                                                                     | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
 |--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
@@ -125,3 +132,26 @@ trainer.train()
 | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28                                                       |
 | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese)        | 87.42                                                       |

 ---
 # TeenyTinyLlama-162m-HateBR
+TeenyTinyLlama is a series of small foundational models trained in Brazilian Portuguese.
+This repository contains a version of [TeenyTinyLlama-162m](https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m) (`TeenyTinyLlama-162m-HateBR`) fine-tuned on the [HateBR dataset](https://huggingface.co/datasets/ruanchaves/hatebr).
+## Details
+- **Number of Epochs:** 3
+- **Batch size:** 16
+- **Optimizer:** `torch.optim.AdamW` (learning_rate = 4e-5, epsilon = 1e-8)
+- **GPU:** 1 NVIDIA A100-SXM4-40GB
 ## Reproducing
 ```
+## Fine-Tuning Comparisons
 | Models                                                                                     | [HateBr](https://huggingface.co/datasets/ruanchaves/hatebr) |
 |--------------------------------------------------------------------------------------------|-------------------------------------------------------------|
 | [Bert-base-portuguese-cased](https://huggingface.co/neuralmind/bert-base-portuguese-cased) | 91.28                                                       |
 | [Gpt2-small-portuguese](https://huggingface.co/pierreguillou/gpt2-small-portuguese)        | 87.42                                                       |
+## Cite as 🤗
+```latex
+@misc{nicholas22llama,
+  doi = {10.5281/zenodo.6989727},
+  url = {https://huggingface.co/nicholasKluge/TeenyTinyLlama-162m},
+  author = {Nicholas Kluge Corrêa},
+  title = {TeenyTinyLlama},
+  year = {2023},
+  publisher = {HuggingFace},
+  journal = {HuggingFace repository},
+}
+```
+## Funding
+This repository was built as part of the RAIES ([Rede de Inteligência Artificial Ética e Segura](https://www.raies.org/)) initiative, a project supported by FAPERGS - ([Fundação de Amparo à Pesquisa do Estado do Rio Grande do Sul](https://fapergs.rs.gov.br/inicial)), Brazil.
+## License
+The TeenyTinyLlama-162m-HateBR is licensed under the Apache License, Version 2.0. See the [LICENSE](LICENSE) file for more details.