jarodrigues
commited on
Commit
•
5c5bc0f
1
Parent(s):
55e6906
Update README.md
Browse files
README.md
CHANGED
@@ -42,15 +42,16 @@ It has different versions that were trained for different variants of Portuguese
|
|
42 |
namely the European variant from Portugal (**PT-PT**) and the American variant from Brazil (**PT-BR**),
|
43 |
and it is distributed free of charge and under a most permissible license.
|
44 |
|
45 |
-
**Albertina PT-BR** is the version for American **Portuguese** from **Brazil**,
|
46 |
-
|
47 |
-
|
|
|
48 |
that sets a new state of the art for it, and is made publicly available
|
49 |
and distributed for reuse.
|
50 |
|
51 |
|
52 |
|
53 |
-
|
54 |
For further details, check the respective [publication](https://arxiv.org/abs/2305.06721):
|
55 |
|
56 |
``` latex
|
|
|
42 |
namely the European variant from Portugal (**PT-PT**) and the American variant from Brazil (**PT-BR**),
|
43 |
and it is distributed free of charge and under a most permissible license.
|
44 |
|
45 |
+
**Albertina PT-BR** is the version for American **Portuguese** from **Brazil**, trained on the brWaC data set.
|
46 |
+
|
47 |
+
You may be interested also in [**Albertina PT-BR No-brWaC**](https://huggingface.co/PORTULAN/albertina-ptbr-nobrwac), trained on data sets other than brWaC and thus with a more permissive license.
|
48 |
+
To the best of our knowledge, these are encoders specifically for this language and variant
|
49 |
that sets a new state of the art for it, and is made publicly available
|
50 |
and distributed for reuse.
|
51 |
|
52 |
|
53 |
|
54 |
+
**Albertina PT-BR** is developed by a joint team from the University of Lisbon and the University of Porto, Portugal.
|
55 |
For further details, check the respective [publication](https://arxiv.org/abs/2305.06721):
|
56 |
|
57 |
``` latex
|