nicholasKluge
/

TeenyTinyLlama-460m-awq

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

nicholasKluge commited on Jan 28

Commit

cc44bcc

•

1 Parent(s): 0f2cb9f

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -41,6 +41,8 @@ co2_eq_emissions:
 Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
 ## Details
 - **Architecture:** a Transformer-based model pre-trained via causal language modeling
@@ -205,7 +207,7 @@ All the shown results are the higher accuracy scores achieved on the respective
 ```latex
 @misc{correa24ttllama,
-  title = {TeenyTinyLlama: a pair of open-source tiny language models trained in Brazilian Portuguese},
   author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
   journal={arXiv},
   year = {2024},

 Large language models (LLMs) have significantly advanced natural language processing, but their progress has yet to be equal across languages. While most LLMs are trained in high-resource languages like English, multilingual models generally underperform monolingual ones. Additionally, aspects of their multilingual foundation sometimes restrict the byproducts they produce, like computational demands and licensing regimes. Hence, we developed the _TeenyTinyLlama_ pair: two compact models for Brazilian Portuguese text generation.
+Read our preprint on [ArXiv](xxx).
 ## Details
 - **Architecture:** a Transformer-based model pre-trained via causal language modeling
 ```latex
 @misc{correa24ttllama,
+  title = {TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese},
   author = {Corr{\^e}a, Nicholas Kluge and Falk, Sophia and Fatimah, Shiza and Sen, Aniket and De Oliveira, Nythamar},
   journal={arXiv},
   year = {2024},