π€ Optimum Habana
π€ Optimum Habana is the interface between the π€ Transformers and π€ Diffusers libraries and Habanaβs Gaudi processor (HPU). It provides a set of tools that enable easy model loading, training and inference on single- and multi-HPU settings for various downstream tasks as shown in the table below.
HPUs offer fast model training and inference as well as a great price-performance ratio. Check out this blog post about BERT pre-training and this article benchmarking Habana Gaudi2 versus Nvidia A100 GPUs for concrete examples. If you are not familiar with HPUs, we recommend you take a look at our conceptual guide.
The table below shows which model architectures, tasks and device distributions are currently supported for π€ Optimum Habana:
Architecture | Single Card | Multi Card | DeepSpeed | |
---|---|---|---|---|
BERT | β | β | β | |
RoBERTa | β | β | β | |
ALBERT | β | β | β | |
DistilBERT | β | β | β | |
GPT2 | β | β | β | |
T5 | β | β | β | |
ViT | β | β | β | |
Swin | β | β | β | |
Wav2Vec2 | β | β | β | |
Stable Diffusion | β | β | β | |
CLIP | β | β | β | |
BLOOM(Z) | β | β | β | |
StarCoder | β | β | β | |
ESMFold | β | β | β | |
GPT-J | β | β | β | |
GPT-NeoX | β | β | β | |
OPT | β | β | β | |
BridgeTower | β | β | β | |
Llama | β | β | β |
Other models and tasks supported by the π€ Transformers library may also work. You can refer to this page for examples of how to use them with π€ Optimum Habana. Besides, the Quickstart explains how to modify any example from the π€ Transformers library to make it work with π€ Optimum Habana.
Learn the basics and become familiar with training transformers on HPUs with π€ Optimum. Start here if you are using π€ Optimum Habana for the first time!
Practical guides to help you achieve a specific goal. Take a look at these guides to learn how to use π€ Optimum Habana to solve real-world problems.
High-level explanations for building a better understanding of important topics such as HPUs.
Technical descriptions of how the Habana classes and methods of π€ Optimum Habana work.