venkatasg
/

lil-bevo-x

Inference Endpoints

Model card Files Files and versions Community

venkatasg commited on Jul 21, 2023

Commit

ad6f442

·

1 Parent(s): de9e48f

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -2,6 +2,8 @@
 license: mit
 language:
 - en
 ---
 # Lil-Bevo-X
@@ -11,11 +13,11 @@ Lil-Bevo-X is UT Austin's submission to the BabyLM challenge, specifically the *
 [Link to GitHub Repo](https://github.com/venkatasg/Lil-Bevo)
 ## TLDR:
-- Unigram tokenizer trained on 100M BabyLM tokens plus MAESTRO dataset for a vocab size of 32k.
 - `deberta-base-v3` trained on mixture of MAESTRO and 100M tokens for 3 epochs.
 - Model continues training for 100,000 steps with 128 sequence length.
 - Model continues training for 65,000 steps with 512 sequence length.
 - Model is trained with targeted linguistic masking for 1 epoch.
-  This README will be updated with more details soon.

 license: mit
 language:
 - en
+tags:
+- babylm
 ---
 # Lil-Bevo-X
 [Link to GitHub Repo](https://github.com/venkatasg/Lil-Bevo)
 ## TLDR:
+- Unigram tokenizer trained on 10M BabyLM tokens plus MAESTRO dataset for a vocab size of 32k.
 - `deberta-base-v3` trained on mixture of MAESTRO and 100M tokens for 3 epochs.
 - Model continues training for 100,000 steps with 128 sequence length.
 - Model continues training for 65,000 steps with 512 sequence length.
 - Model is trained with targeted linguistic masking for 1 epoch.
+  This README will be updated with more details soon.