Models

Natural Language Processing

The following classes are available for the following natural language processing tasks.

OVModelForCausalLM

class optimum.intel.OVModelForCausalLM

( model: Model config: PretrainedConfig = None device: str = 'CPU' dynamic_shapes: bool = True ov_config: typing.Union[typing.Dict[str, str], NoneType] = None model_save_dir: typing.Union[str, pathlib.Path, tempfile.TemporaryDirectory, NoneType] = None quantization_config: typing.Union[optimum.intel.openvino.configuration.OVWeightQuantizationConfig, typing.Dict, NoneType] = None **kwargs )

Parameters

model (openvino.runtime.Model) — is the main class used to run OpenVINO Runtime inference.
config (transformers.PretrainedConfig) — PretrainedConfig is the Model configuration class with all the parameters of the model. Initializing with a config file does not load the weights associated with the model, only the configuration. Check out the ~intel.openvino.modeling.OVBaseModel.from_pretrained method to load the model weights.
device (str, defaults to "CPU") — The device type for which the model will be optimized for. The resulting compiled model will contains nodes specific to this device.
dynamic_shapes (bool, defaults to True) — All the model’s dimension will be set to dynamic when set to True. Should be set to False for the model to not be dynamically reshaped by default.
ov_config (Optional[Dict], defaults to None) — The dictionnary containing the informations related to the model compilation.
compile (bool, defaults to True) — Disable the model compilation during the loading step when set to False. Can be useful to avoid unnecessary compilation, in the case where the model needs to be statically reshaped, the device modified or if FP16 conversion is enabled.

OpenVINO Model with a causal language modeling head on top (linear layer with weights tied to the input embeddings).

This model inherits from optimum.intel.openvino.modeling.OVBaseModel. Check the superclass documentation for the generic methods the library implements for all its model (such as downloading or saving)

Optimum

Models

Natural Language Processing

OVModelForCausalLM

class optimum.intel.OVModelForCausalLM

forward

generate

OVModelForMaskedLM

class optimum.intel.OVModelForMaskedLM

forward

OVModelForSeq2SeqLM

class optimum.intel.OVModelForSeq2SeqLM

forward

OVModelForQuestionAnswering

class optimum.intel.OVModelForQuestionAnswering

forward

OVModelForSequenceClassification

class optimum.intel.OVModelForSequenceClassification

forward

OVModelForTokenClassification

class optimum.intel.OVModelForTokenClassification

forward

Audio

OVModelForAudioClassification

class optimum.intel.OVModelForAudioClassification

forward

OVModelForAudioFrameClassification

class optimum.intel.OVModelForAudioFrameClassification

forward

OVModelForCTC

class optimum.intel.OVModelForCTC

forward

OVModelForAudioXVector

class optimum.intel.OVModelForAudioXVector

forward

OVModelForSpeechSeq2Seq

class optimum.intel.OVModelForSpeechSeq2Seq

forward

Computer Vision

OVModelForImageClassification

class optimum.intel.OVModelForImageClassification

forward

Multimodal

OVModelForVision2Seq

class optimum.intel.OVModelForVision2Seq

forward

OVModelForPix2Struct

class optimum.intel.OVModelForPix2Struct

forward

Custom Tasks

OVModelForCustomTasks

class optimum.intel.OVModelForCustomTasks

forward

OVModelForFeatureExtraction

class optimum.intel.OVModelForFeatureExtraction

forward

Quantization

OVQuantizer

class optimum.intel.OVQuantizer

get_calibration_dataset

quantize