Update README.md
Browse files
README.md
CHANGED
@@ -13,10 +13,11 @@ This model card will contain more information *soon*. Please reach out to Alexan
|
|
13 |
TBD
|
14 |
|
15 |
## Training data
|
16 |
-
|
|
|
17 |
|
18 |
## Training procedure
|
19 |
-
|
20 |
|
21 |
## Evaluation results
|
22 |
TBD
|
|
|
13 |
TBD
|
14 |
|
15 |
## Training data
|
16 |
+
2.5 billion tweets with 56 billion subwords in 66 languages (as identified in Twitter metadata).
|
17 |
+
The tweets are collected from the 1% public Twitter stream between January 2016 and December 2021.
|
18 |
|
19 |
## Training procedure
|
20 |
+
RoBERTa pre-training with BERT-base architecture.
|
21 |
|
22 |
## Evaluation results
|
23 |
TBD
|