How to use this model directly from the
tokenizer = AutoTokenizer.from_pretrained("codegram/calbert-base-uncased") model = AutoModel.from_pretrained("codegram/calbert-base-uncased")
CALBERT is an open-source language model for Catalan based on the ALBERT architecture.
It is now available on Hugging Face in its
base-uncased version, and was pretrained on the OSCAR dataset.
For further information or requests, please go to the GitHub repository
||Base (uncased)||OSCAR (4.3 GB of text)|