catherinearnett
commited on
Commit
•
e7e8bba
1
Parent(s):
6f61b9e
Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ library_name: transformers
|
|
12 |
|
13 |
# B-GPT_nl_en_sequential
|
14 |
|
15 |
-
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on Dutch data. In the second half of training, the model was trained on only
|
16 |
|
17 |
## Model details:
|
18 |
|
|
|
12 |
|
13 |
# B-GPT_nl_en_sequential
|
14 |
|
15 |
+
This is a bilingual GPT-2 style model. For the first half of training, this model was trained only on Dutch data. In the second half of training, the model was trained on only English data. At the end of training, 50 % of training data seen by the model is Dutch and 50 % is English. The tokenizer was trained on the same overall proportions of data as the language model at the final step.
|
16 |
|
17 |
## Model details:
|
18 |
|