jarodrigues commited on
Commit
5c5bc0f
1 Parent(s): 55e6906

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -4
README.md CHANGED
@@ -42,15 +42,16 @@ It has different versions that were trained for different variants of Portuguese
42
  namely the European variant from Portugal (**PT-PT**) and the American variant from Brazil (**PT-BR**),
43
  and it is distributed free of charge and under a most permissible license.
44
 
45
- **Albertina PT-BR** is the version for American **Portuguese** from **Brazil**,
46
- and to the best of our knowledge, at the time of its initial distribution,
47
- it is an encoder specifically for this language and variant
 
48
  that sets a new state of the art for it, and is made publicly available
49
  and distributed for reuse.
50
 
51
 
52
 
53
- It is developed by a joint team from the University of Lisbon and the University of Porto, Portugal.
54
  For further details, check the respective [publication](https://arxiv.org/abs/2305.06721):
55
 
56
  ``` latex
 
42
  namely the European variant from Portugal (**PT-PT**) and the American variant from Brazil (**PT-BR**),
43
  and it is distributed free of charge and under a most permissible license.
44
 
45
+ **Albertina PT-BR** is the version for American **Portuguese** from **Brazil**, trained on the brWaC data set.
46
+
47
+ You may be interested also in [**Albertina PT-BR No-brWaC**](https://huggingface.co/PORTULAN/albertina-ptbr-nobrwac), trained on data sets other than brWaC and thus with a more permissive license.
48
+ To the best of our knowledge, these are encoders specifically for this language and variant
49
  that sets a new state of the art for it, and is made publicly available
50
  and distributed for reuse.
51
 
52
 
53
 
54
+ **Albertina PT-BR** is developed by a joint team from the University of Lisbon and the University of Porto, Portugal.
55
  For further details, check the respective [publication](https://arxiv.org/abs/2305.06721):
56
 
57
  ``` latex