omarmomen commited on
Commit
e1623e8
1 Parent(s): 623349b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -13,6 +13,7 @@ pipeline_tag: fill-mask
13
 
14
  This model is part of the experiments in my master's thesis titled "Linguistic Structure Induction from Language Models" (https://arxiv.org/abs/2403.09714).
15
 
16
- "omarmomen/tf_babylm_1" is the Vanilla Transformer (TF) referred to in Chapter 5 (p. 59); it is a transformer encoder model.
 
 
17
 
18
- The model is trained on the BabyLM 10M dataset, with a RobertaTokenizer pretrained on the BabyLM 10M dataset with 16K tokens.
 
13
 
14
  This model is part of the experiments in my master's thesis titled "Linguistic Structure Induction from Language Models" (https://arxiv.org/abs/2403.09714).
15
 
16
+ "omarmomen/tf_babylm_1" is the vanilla transformer encoder referred to in Chapter 5 (p. 59).
17
+
18
+ The model is trained on the BabyLM 10M dataset, with a RobertaTokenizer pretrained on the BabyLM 10M dataset with 16K tokens (https://huggingface.co/omarmomen/babylm_bpe_tokenizer_16k).
19