Optimum documentation

πŸ€— Optimum notebooks

You are viewing main version, which requires installation from source. If you'd like regular pip install, checkout the latest stable version (v1.17.1).
Hugging Face's logo
Join the Hugging Face community

and get access to the augmented documentation experience

to get started

πŸ€— Optimum notebooks

You can find here a list of the notebooks associated with each accelerator in πŸ€— Optimum.

Optimum Habana

Notebook Description Colab Studio Lab
How to use DeepSpeed to train models with billions of parameters on Habana Gaudi Show how to use DeepSpeed to pre-train/fine-tune the 1.6B-parameter GPT2-XL for causal language modeling on Habana Gaudi. Open in Colab Open in AWS Studio

Optimum Intel

OpenVINO

Notebook Description Colab Studio Lab
How to run inference with OpenVINO Explains how to export your model to OpenVINO and run inference with OpenVINO Runtime on various tasks Open in Colab Open in AWS Studio
How to quantize a question answering model with NNCF Show how to apply post-training quantization on a question answering model using NNCF and to accelerate inference with OpenVINO Open in Colab Open in AWS Studio
Compare outputs of a quantized Stable Diffusion model with its full-precision counterpart Show how to load and compare outputs from two Stable Diffusion models with different precision Open in Colab Open in AWS Studio

Neural Compressor

Notebook Description Colab Studio Lab
How to quantize a model with Intel Neural Compressor for text classification Show how to apply quantization while training your model using Intel Neural Compressor for any GLUE task. Open in Colab Open in AWS Studio

Optimum ONNX Runtime

Notebook Description Colab Studio Lab
How to quantize a model with ONNX Runtime for text classification Show how to apply static and dynamic quantization on a model using ONNX Runtime for any GLUE task. Open in Colab Open in AWS Studio
How to fine-tune a model for text classification with ONNX Runtime Show how to DistilBERT model on GLUE tasks using ONNX Runtime. Open in Colab Open in AWS Studio
How to fine-tune a model for summarization with ONNX Runtime Show how to fine-tune a T5 model on the BBC news corpus. Open in Colab Open in AWS Studio
How to fine-tune DeBERTa for question-answering with ONNX Runtime Show how to fine-tune a DeBERTa model on the squad. Open in Colab Open in AWS Studio