BR_BERTo

julien-c commited on
Commit b82a727
1 Parent(s): 5e09b6b

Migrate model card from transformers-repo

Browse files

Read announcement at https://discuss.huggingface.co/t/announcement-all-model-cards-will-be-migrated-to-hf-co-model-repos/2755
Original file history: https://github.com/huggingface/transformers/commits/master/model_cards/rdenadai/BR_BERTo/README.md

Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: pt
3
+ tags:
4
+ - portuguese
5
+ - brazil
6
+ - pt_BR
7
+ widget:
8
+ - text: gostei muito dessa <mask>
9
+ ---
10
+
11
+ # BR_BERTo
12
+
13
+ Portuguese (Brazil) model for text inference.
14
+
15
+ ## Params
16
+
17
+ Trained on a corpus of 6_993_330 sentences.
18
+
19
+ - Vocab size: 150_000
20
+ - RobertaForMaskedLM size : 512
21
+ - Num train epochs: 3
22
+ - Time to train: ~10days (on GCP with a Nvidia T4)
23
+
24
+ I follow the great tutorial from HuggingFace team:
25
+
26
+ [How to train a new language model from scratch using Transformers and Tokenizers](https://huggingface.co/blog/how-to-train)
27
+
28
+ More infor here:
29
+
30
+ [BR_BERTo](https://github.com/rdenadai/BR-BERTo)