celiolarcher commited on
Commit
fc2a2de
1 Parent(s): e2ed3c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md CHANGED
@@ -1,3 +1,22 @@
1
  ---
2
  license: apache-2.0
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ language:
4
+ - pt
5
+ - en
6
  ---
7
+ The Cabrita model is a collection of continued pre-trained and tokenizer-adapted models for the Portuguese language.
8
+ This artifact is the 3 billion size variant.
9
+
10
+ The weights were initially obtained from the open-llama project (https://github.com/openlm-research/open_llama) in the
11
+ open_llama_3b option.
12
+
13
+ ```
14
+ @misc{larcher2023cabrita,
15
+ title={Cabrita: closing the gap for foreign languages},
16
+ author={Celio Larcher and Marcos Piau and Paulo Finardi and Pedro Gengo and Piero Esposito and Vinicius Caridá},
17
+ year={2023},
18
+ eprint={2308.11878},
19
+ archivePrefix={arXiv},
20
+ primaryClass={cs.CL}
21
+ }
22
+ ```