🤗 Optimum

🤗 Optimum is an extension of 🤗 Transformers that provides a set of performance optimization tools to train and run models on targeted hardware with maximum efficiency.

The AI ecosystem evolves quickly, and more and more specialized hardware along with their own optimizations are emerging every day. As such, 🤗 Optimum enables developers to efficiently use any of these platforms with the same ease inherent to 🤗 Transformers.

🤗 Optimum is distributed as a collection of packages - check out the links below for an in-depth look at each one.

Optimum Graphcore

Train Transformers model on Graphcore IPUs, a completely new kind of massively parallel processor to accelerate machine intelligence.

Optimum Habana

Maximize training throughput and efficiency with Habana's Gaudi processor.

Optimum Intel

Accelerate Transformers and Diffusers models inference using OpenVINO and Intel Neural Compressor to optimize your model.

ONNX Runtime

Apply quantization and graph optimization to accelerate Transformers models training and inference with ONNX Runtime

Torch FX

Create and compose custom graph transformations to optimize PyTorch Transformers models with Torch FX

BetterTransformer

A one-liner integration to use PyTorch's BetterTransformer with Transformers models