How to use this model directly from the
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("jannesg/bertsson")
model = AutoModel.from_pretrained("jannesg/bertsson")
How to clone the model repo
git lfs install
git clone https://huggingface.co/jannesg/bertsson
# if you want to clone without large files – just their pointers
# prepend your git clone with the following env var:
Unable to determine this model’s pipeline type. Check the
The models are trained on:
Corpus size: Roughly 6B tokens.
The following models are currently available:
All models are cased and trained with whole word masking.
Stay tuned for evaluations.