How to use this model directly from the
from transformers import AutoTokenizer, AutoModel tokenizer = AutoTokenizer.from_pretrained("jannesg/bertsson") model = AutoModel.from_pretrained("jannesg/bertsson")
The models are trained on:
Corpus size: Roughly 6B tokens.
The following models are currently available:
All models are cased and trained with whole word masking.
Stay tuned for evaluations.