LTG-BERT for the BabyLM challenge

This is the LTG-BERT baseline trained on the 100MW BabyLM challenge dataset.

