Transformers documentation

Auto Classes

Transformers

You are viewing v4.21.1 version. A newer version v4.57.1 is available.

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Auto Classes

In many cases, the architecture you want to use can be guessed from the name or the path of the pretrained model you are supplying to the from_pretrained() method. AutoClasses are here to do this job for you so that you automatically retrieve the relevant model given the name/path to the pretrained weights/config/vocabulary.

Instantiating one of AutoConfig, AutoModel, and AutoTokenizer will directly create a class of the relevant architecture. For instance

model = AutoModel.from_pretrained("bert-base-cased")

will create a model that is an instance of BertModel.

There is one class of AutoModel for each task, and for each backend (PyTorch, TensorFlow, or Flax).

Extending the Auto Classes

Each of the auto classes has a method to be extended with your custom classes. For instance, if you have defined a custom class of model NewModel, make sure you have a NewModelConfig then you can add those to the auto classes like this:

from transformers import AutoConfig, AutoModel

AutoConfig.register("new-model", NewModelConfig)
AutoModel.register(NewModelConfig, NewModel)

You will then be able to use the auto classes like you would usually do!

If your NewModelConfig is a subclass of PretrainedConfig, make sure its model_type attribute is set to the same key you use when registering the config (here "new-model").

Likewise, if your NewModel is a subclass of PreTrainedModel, make sure its config_class attribute is set to the same class you use when registering the model (here NewModelConfig).

Transformers

Auto Classes

Extending the Auto Classes

AutoConfig

class transformers.AutoConfig

from_pretrained

register

AutoTokenizer

class transformers.AutoTokenizer

from_pretrained

register

AutoFeatureExtractor

class transformers.AutoFeatureExtractor

from_pretrained

register

AutoProcessor

class transformers.AutoProcessor

from_pretrained

register

AutoModel

class transformers.AutoModel

from_config

from_pretrained

AutoModelForPreTraining

class transformers.AutoModelForPreTraining

from_config

from_pretrained

AutoModelForCausalLM

class transformers.AutoModelForCausalLM

from_config

from_pretrained

AutoModelForMaskedLM

class transformers.AutoModelForMaskedLM

from_config

from_pretrained

AutoModelForSeq2SeqLM

class transformers.AutoModelForSeq2SeqLM

from_config

from_pretrained

AutoModelForSequenceClassification

class transformers.AutoModelForSequenceClassification

from_config

from_pretrained

AutoModelForMultipleChoice

class transformers.AutoModelForMultipleChoice

from_config

from_pretrained

AutoModelForNextSentencePrediction

class transformers.AutoModelForNextSentencePrediction

from_config

from_pretrained

AutoModelForTokenClassification

class transformers.AutoModelForTokenClassification

from_config

from_pretrained

AutoModelForQuestionAnswering

class transformers.AutoModelForQuestionAnswering

from_config

from_pretrained

AutoModelForTableQuestionAnswering

class transformers.AutoModelForTableQuestionAnswering

from_config

from_pretrained

AutoModelForImageClassification

class transformers.AutoModelForImageClassification

from_config

from_pretrained

AutoModelForVision2Seq

class transformers.AutoModelForVision2Seq

from_config

from_pretrained

AutoModelForVisualQuestionAnswering

class transformers.AutoModelForVisualQuestionAnswering

from_config

from_pretrained

AutoModelForAudioClassification

class transformers.AutoModelForAudioClassification

from_config

from_pretrained

AutoModelForAudioFrameClassification