Use BioGptModel to get features

#19
by pete88b - opened

We can't use BioGptForCausalLM to get the features as it returns logits for next token prediction. I'm pretty sure we want BioGptModel to give us last_hidden_state

Hi,

You can just load a BioGptModel from this checkpoint, it will just give you a warning that the head on top isn't initialized but you can use it for feature extraction:

from transformers import BioGptModel

model = BioGptModel.from_pretrained("microsoft/biogpt")

Hi Niels,

so my suggested change to the docs is good to merge (o:

and while i have your ear - thank you so much for notebooks like NER with BERT - really helpful

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment