Transformers documentation

Auto Classes

Transformers

You are viewing v4.38.0 version. A newer version v5.0.0rc0 is available.

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

to get started

Auto Classes

In many cases, the architecture you want to use can be guessed from the name or the path of the pretrained model you are supplying to the from_pretrained() method. AutoClasses are here to do this job for you so that you automatically retrieve the relevant model given the name/path to the pretrained weights/config/vocabulary.

Instantiating one of AutoConfig, AutoModel, and AutoTokenizer will directly create a class of the relevant architecture. For instance

model = AutoModel.from_pretrained("google-bert/bert-base-cased")

will create a model that is an instance of BertModel.

There is one class of AutoModel for each task, and for each backend (PyTorch, TensorFlow, or Flax).

Extending the Auto Classes

Each of the auto classes has a method to be extended with your custom classes. For instance, if you have defined a custom class of model NewModel, make sure you have a NewModelConfig then you can add those to the auto classes like this:

from transformers import AutoConfig, AutoModel

AutoConfig.register("new-model", NewModelConfig)
AutoModel.register(NewModelConfig, NewModel)

You will then be able to use the auto classes like you would usually do!

If your NewModelConfig is a subclass of PretrainedConfig, make sure its model_type attribute is set to the same key you use when registering the config (here "new-model").

Likewise, if your NewModel is a subclass of PreTrainedModel, make sure its config_class attribute is set to the same class you use when registering the model (here NewModelConfig).

Transformers

Auto Classes

Extending the Auto Classes

AutoConfig

class transformers.AutoConfig

from_pretrained

register

AutoTokenizer

class transformers.AutoTokenizer

from_pretrained

register

AutoFeatureExtractor

class transformers.AutoFeatureExtractor

from_pretrained

register

AutoImageProcessor

class transformers.AutoImageProcessor

from_pretrained

register

AutoProcessor

class transformers.AutoProcessor

from_pretrained

register

Generic model classes

AutoModel

class transformers.AutoModel

from_config

from_pretrained

TFAutoModel

class transformers.TFAutoModel

from_config

from_pretrained

FlaxAutoModel

class transformers.FlaxAutoModel

from_config

from_pretrained

Generic pretraining classes

AutoModelForPreTraining

class transformers.AutoModelForPreTraining

from_config

from_pretrained

TFAutoModelForPreTraining

class transformers.TFAutoModelForPreTraining

from_config

from_pretrained

FlaxAutoModelForPreTraining

class transformers.FlaxAutoModelForPreTraining

from_config

from_pretrained

Natural Language Processing

AutoModelForCausalLM

class transformers.AutoModelForCausalLM

from_config

from_pretrained

TFAutoModelForCausalLM

class transformers.TFAutoModelForCausalLM

from_config

from_pretrained

FlaxAutoModelForCausalLM

class transformers.FlaxAutoModelForCausalLM

from_config

from_pretrained

AutoModelForMaskedLM

class transformers.AutoModelForMaskedLM

from_config

from_pretrained

TFAutoModelForMaskedLM

class transformers.TFAutoModelForMaskedLM

from_config

from_pretrained

FlaxAutoModelForMaskedLM

class transformers.FlaxAutoModelForMaskedLM

from_config

from_pretrained

AutoModelForMaskGeneration

class transformers.AutoModelForMaskGeneration

TFAutoModelForMaskGeneration

class transformers.TFAutoModelForMaskGeneration

AutoModelForSeq2SeqLM

class transformers.AutoModelForSeq2SeqLM