cambridge-climb (CLIMB)

Organization Card

This repository is Cambridge University NLP's submission to the 2023 BabyLM Challenge (CoNLL workshop).

Our approach experiments with the following three variants of cognitively-motivated curriculum learning and analyze their effect on the performance of the model on linguistic evaluation tasks.

vocabulary curriculum we analyze methods for constraining the vocabulary in the early stages of training to simulate cognitively more plausible learning curves.
data curriculum we vary the order of the training instances based on i) infant-inspired expectations and ii) the learning behaviour of the model
objective curriculum we explore different variations of combining the conventional masked language modelling task with a more coarse-grained word class prediction task to reinforce linguistic generalization capabilities.

Overall, we find that various curriculum learning settings outperform our baseline in linguistic tasks. We moreover find that careful selection of model architecture, and training hyper-parameters yield substantial improvements over the default baselines provided by the BabyLM challenge.

models 8

CLIMB

AI & ML interests

models 8

cambridge-climb/objective_curriculum-roberta_pre_layer_norm-model

cambridge-climb/combination-roberta_pre_layer_norm-model

cambridge-climb/data_curriculum-roberta_pre_layer_norm-model

cambridge-climb/vocabulary_curriculum-roberta_pre_layer_norm-model

cambridge-climb/CamBabyTokenizer-8192

cambridge-climb/CamBabyTokenizer-32768

cambridge-climb/CamBabyTokenizer-16384

cambridge-climb/baseline-roberta_pre_layer_norm-model

datasets 1

cambridge-climb/BabyLM

AI & ML interests

Team members 7

models 8 Sort: Recently updated

datasets 1

models 8