microsoft/BiomedNLP-BiomedBERT-base-uncased-abstract

Thanks for your comment. When checking the size of the embedding matrix (cc @nbroad ):

from transformers import BertModel

model = BertModel.from_pretrained("microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract")

print(model.embeddings.word_embeddings.weight.shape)

it does print a shape of torch.Size([30522, 768]).

I guess that the last vectors of the embedding matrix are actually never used and could be removed from the model.

However, simply updating the vocab_size attribute of the config will result in an error, as it will complain that the updated size doesn't match the size of the embedding matrix. So one should update the vocab_size attribute and the embedding matrix at the same time.

microsoft
/

BiomedNLP-BiomedBERT-base-uncased-abstract

Question about vocab size