🤗 Optimum
🤗 Optimum is an extension of 🤗 Transformers that provides a set of performance optimization tools to train and run models on targeted hardware with maximum efficiency.
The AI ecosystem evolves quickly, and more and more specialized hardware along with their own optimizations are emerging every day. As such, 🤗 Optimum enables developers to efficiently use any of these platforms with the same ease inherent to 🤗 Transformers.
🤗 Optimum is distributed as a collection of packages - check out the links below for an in-depth look at each one.
Train Transformers model on Graphcore IPUs, a completely new kind of massively parallel processor to accelerate machine intelligence.
Maximize training throughput and efficiency with Habana's Gaudi processor.
Accelerate Transformers and Diffusers models inference using OpenVINO and Intel Neural Compressor to optimize your model.
Apply quantization and graph optimization to accelerate Transformers models training and inference with ONNX Runtime
Create and compose custom graph transformations to optimize PyTorch Transformers models with Torch FX
A one-liner integration to use PyTorch's BetterTransformer with Transformers models