Installation Guide

Welcome to the installation guide for the bitsandbytes library! This document provides step-by-step instructions to install bitsandbytes across various platforms and hardware configurations. The library primarily supports CUDA-based GPUs, but the team is actively working on enabling support for additional backends like CPU, AMD ROCm, Intel XPU, and Gaudi HPU.

CUDA
Multi-Backend Preview

CUDA

bitsandbytes is currently supported on NVIDIA GPUs with Compute Capability 5.0+. The library can be built using CUDA Toolkit versions as old as 11.6 on Windows and 11.4 on Linux.

Feature	CC Required	Example Hardware Requirement
LLM.int8()	7.5+	Turing (RTX 20 series, T4) or newer GPUs
8-bit optimizers/quantization	5.0+	Maxwell (GTX 900 series, TITAN X, M40) or newer GPUs
NF4/FP4 quantization	5.0+	Maxwell (GTX 900 series, TITAN X, M40) or newer GPUs

Support for Maxwell GPUs is deprecated and will be removed in a future release. For the best results, a Turing generation device or newer is recommended.

Installation via PyPI

This is the most straightforward and recommended installation option.

The currently distributed bitsandbytes packages are built with the following configurations:

OS	CUDA Toolkit	Host Compiler	Targets
Linux x86-64	11.8 - 12.6	GCC 11.2	sm50, sm60, sm75, sm80, sm86, sm89, sm90
Linux x86-64	12.8	GCC 11.2	sm75, sm80, sm86, sm89, sm90, sm100, sm120
Linux aarch64	11.8 - 12.6	GCC 11.2	sm75, sm80, sm90
Linux aarch64	12.8	GCC 11.2	sm75, sm80, sm90, sm100
Windows x86-64	11.8 - 12.6	MSVC 19.43+ (VS2022)	sm50, sm60, sm75, sm80, sm86, sm89, sm90
Windows x86-64	12.8	MSVC 19.43+ (VS2022)	sm75, sm80, sm86, sm89, sm90, sm100, sm120

Use pip or uv to install:

pip install bitsandbytes

Compile from source

Don’t hesitate to compile from source! The process is pretty straight forward and resilient. This might be needed for older CUDA Toolkit versions or Linux distributions, or other less common configurations.

For Linux and Windows systems, compiling from source allows you to customize the build configurations. See below for detailed platform-specific instructions (see the CMakeLists.txt if you want to check the specifics and explore some additional options):

Linux

Windows

Preview Wheels from main

If you would like to use new features even before they are officially released and help us test them, feel free to install the wheel directly from our CI (the wheel links will remain stable!):

Linux

Windows

Multi-Backend Preview

This functionality existed as an early technical preview and is not recommended for production use. We are in the process of upstreaming improved support for AMD and Intel hardware into the main project.

We provide an early preview of support for AMD and Intel hardware as part of a development branch.

Supported Backends

Backend	Supported Versions	Python versions	Architecture Support	Status
AMD ROCm	6.1+	3.10+	minimum CDNA - `gfx90a`, RDNA - `gfx1100`	Alpha
Intel CPU	v2.4.0+ (`ipex`)	3.10+	Intel CPU	Alpha
Intel GPU	v2.4.0+ (`ipex`)	3.10+	Intel GPU	Experimental
Ascend NPU	2.1.0+ (`torch_npu`)	3.10+	Ascend NPU	Experimental

For each supported backend, follow the respective instructions below:

Pre-requisites

To use this preview version of bitsandbytes with transformers, be sure to install:

pip install "transformers>=4.45.1"

AMD ROCm

Intel XPU

Installation

You can install the pre-built wheels for each backend, or compile from source for custom configurations.

Pre-built Wheel Installation (recommended)

Linux

Windows

Compile from Source

AMD ROCm

Intel CPU + GPU

Ascend NPU

< > Update on GitHub

Bitsandbytes

Installation Guide

Table of Contents

CUDA

Installation via PyPI

Compile from source

Preview Wheels from main

Multi-Backend Preview

Supported Backends

Pre-requisites

Installation

Pre-built Wheel Installation (recommended)

Compile from Source

AMD GPU