Pretraining data cutoff?

#17
by ytsaig - opened

Hello,

What is the date cutoff for the data used for pretraining the model? The associated paper says:
"Although ModernBERT showcase strong results across the board, it should be noted that an important factor in its performance is TREC-COVID (Voorhees et al., 2021), potentially showcasing the benefits of ModernBERT being trained with a more recent knowledge cutoff than most existing encoders. "

However there's no explicit mention of the cutoff date.

Thank you!

Sign up or log in to comment