Apr 7, 2023

I have already installed Accelerate, made sure it was in the PATH, yet I keep getting this message when I try to load the model. Also, I tried installing bitsandby, but after I did the webui wouldn't work at all.

Dorjzodovsuren

May 7, 2023

you need to restart kernel

appleszapples

Jul 23, 2023

same

shahabty

Aug 1, 2023

I have the same issue but I found no solution for it. Is there any solution?

JohnLaw

Aug 2, 2023

same

vishyrjun

Aug 2, 2023

•

edited Aug 2, 2023

I was trying this in Google Colab and keep getting this error as well. I ensured accelerate and bitsandbytes are installed. still keeps getting this issue. Then realized i was having a particular line in my train command that was causing this issue

autotrain llm --train --project_name '<project_name>' \
--model TinyPixel/Llama-2-7B-bf16-sharded \
--data_path timdettmers/openassistant-guanaco \
--text_column text \
--use_int4 \
--use_peft \
--learning_rate 2e-4 \
--train_batch_size 2 \
--num_train_epochs 3 \
--trainer sft \
--model_max_length 2048 \
--push_to_hub \
--repo_id <repo_id>/<project_name> \
--block_size 2048 > training.log &

After i removed --use_int4 line and executed, the issue got resolved. I hope this helps. Please ensure to use '\' at the end of everyline incase of using this command to train.

Thanks.

apshah

Aug 3, 2023

having this issue as well, tried restarting the kernel like a 1000 times, still didn't work

shahabty

Aug 3, 2023

Here is the solution that worked for me:

Install Nvidia docker + docker.
Download an nvidia PyTorch 2.0 docker image with Cuda 12.
Create and execute a container.
Install all packages within the container and run your code.
(you might still need to try different version of dependencies but it finally worked for me)

nehadhirmiz

Aug 18, 2023

•

edited Aug 18, 2023

That was not the issue. My apologies >>>

In the import_utils.py the code used to check if package exist does not work for all packages.

TODO: This doesn't work for all packages (`bs4`, `faiss`, etc.) Talk to Sylvain to see how to do with it better.

def _is_package_available(pkg_name: str, return_version: bool = False) -> Union[Tuple[bool, str], bool]:
# Check we're not importing a "pkg_name" directory somewhere but the actual library by trying to grab the version
package_exists = importlib.util.find_spec(pkg_name) is not None
package_version = "N/A"
if package_exists:
try:
package_version = importlib.metadata.version(pkg_name)
package_exists = True
except importlib.metadata.PackageNotFoundError:
package_exists = False
logger.debug(f"Detected {pkg_name} version {package_version}")
if return_version:
return package_exists, package_version
else:
return package_exists

pham-llm

Sep 13, 2023

Did this work for anyone? I am facing the same issue

Gidz

Sep 15, 2023

•

edited Sep 15, 2023

same issue

I think there might be an underlying problem with bitsandbytes.

I'm using slightly different scenario, but the same library and same results..
I'm on on a cpu only and been relying on tutorials from:
https://huggingface.co/blog/4bit-transformers-bitsandbytes
https://huggingface.co/docs/transformers/main_classes/quantization#general-usage

$ sudo pip install bitsandbytes accelerate transformers

'>>> from accelerate import Accelerator
'>>> from transformers import AutoModelForCausalLM

'>>> path = Path('/models/summarization/bart-large-cnn')
'>>> model_8bit = AutoModelForCausalLM.from_pretrained(path, load_in_8bit=True, device_map="auto")
'>>> model_4bit = AutoModelForCausalLM.from_pretrained(path, load_in_4bit=True, device_map="auto")

ImportError: Using load_in_8bit=True requires Accelerate: pip install accelerate and the latest version of bitsandbytes pip install -i https://test.pypi.org/simple/ bitsandbytes or pip install bitsandbytes`

Gidz

Sep 15, 2023

after more digging around, you have to downgrade your version of transformers > pip install transformers==4.32.0
that enabled the load_in_8bit to be recognised, but still doesn't work for CPU.. accepts GPU only.

zaursamedov1

Sep 17, 2023

I'm getting this error too
"ImportError: Using load_in_8bit=True requires Accelerate: pip install accelerate and the latest version of bitsandbytes pip install -i https://test.pypi.org/simple/ bitsandbytes or pip install bitsandbytes`" ~ while using AutoModelCasualLM.
downgrading transformer didn't workout, installing/upgrading accelerate and bitsandbytes either didn't work. I'm using vscode on Mac m2.

vikramnitin9

Sep 17, 2023

You might also need scipy.

anudeepadi

Sep 21, 2023

I downgraded transformers library to version 4.30 using the following command:
pip install transformers==4.30
Then I restarted the kernel it worked

mani22

Sep 22, 2023

Worked after downgrading transformers to 4.30.

bharatcoder

Sep 29, 2023

Was getting the same issue, resolved after downgrading transformers.
pip install transformers==4.30

nss-ysasaki

Oct 3, 2023

•

edited Oct 3, 2023

It turns out that the ImportError: Using load_in_8bit=True requires Accelerate... error message is shown when:

The accelerate module is not found,
The bitsandbytes module is not found, or
torch does not recognize CUDA (PR).

If you can import both accelerate and bitsandbytes and still get this error, it might be that PyTorch is unable to see CUDA.

You can check the CUDA availability with:

torch.cuda.is_available()

If this evaluates to False, you might want to head to StackOverflow to see why PyTorch cannot recognize CUDA.

(In my case, Docker misconfiguration prevented CUDA from loading, which resulted in nvidia-smi showing CUDA Version: ERR!, causing the cryptic ImportError...)

programmeurke

Oct 3, 2023

Hi, I got the same error using a wrapper library called Ludwig.
It had a configuration parameter
quantization: bits: 4
By removing this configuration setting, the error disappeared.

IntelliDynamics

Oct 4, 2023

•

edited Oct 4, 2023

At first, I installed the current version of Accelerate and had the error. I then backdated by installing an old version of Accelerate which worked, but it didn't support another pkg, so I installed the current version again, and it worked. Behavior like this suggests there is a history of versions that are required, with lurking version control issues in Accelerate and/or one of the packages it depends on. This is a mess, but I'm finally running after many hours.

okamirvs

Oct 8, 2023

This worked for me:

pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
python -m pip install bitsandbytes --prefer-binary --extra-index-url=https://jllllll.github.io/bitsandbytes-windows-webui
pip install transformers==4.34.0
pip install trl==0.7.1
pip install datasets==2.14.5

Basically you have to use that bitsandbytes instead of pip install bitsandbytes

ZRuiii

Oct 19, 2023

I guess this can be a bug while importing packages. I have met the same problems, and add from peft import PeftModel, PeftConfig at the beginning worked for me.

dgkd

Oct 19, 2023

I pip install requirements==4.28.0，and it works! I found in environments installation,the requirements.txt shows transformers>=4.28.0,so I did so.

elnaram

Nov 3, 2023

I used llm_int8_enable_fp32_cpu_offload=True instead of load_in_8bit=True, with transformer version 4.30, and it worked.

aryanjangid

Nov 4, 2023

I tried all the above solutions, but still getting this error
Here is my model.py code

from config import config
from prompts import get_vlm_prompt, get_llm_prompt

import torch

from transformers import (
BitsAndBytesConfig,
InstructBlipProcessor,
InstructBlipForConditionalGeneration,
)

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
double_quant_config = BitsAndBytesConfig(load_in_4bit=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_use_double_quant=True,
bnb_4bit_compute_dtype=torch.bfloat16)

Here is my configuration (requirements.txt)

fastapi==0.103.2
langchain==0.0.311
multion==0.2.2
openai==0.27.10
Pillow==10.0.1
pydantic==2.4.2
python-dotenv==1.0.0
torch==1.13.1
transformers==4.33.3
sentencepiece==0.1.99
accelerate==0.23.0
bitsandbytes==0.41.1
pydantic-settings==2.0.3
python-multipart==0.0.6

suryasinghmv

Nov 8, 2023

I encountered a similar problem where I was facing an ImportError with the message "Using load_in_8bit=True requires Accelerate."

To resolve this issue, I checked my transformers version, which was initially at 4.30.2. After downgrading it to version 4.30, the problem was successfully resolved.

programmeurke

Nov 8, 2023

Anyone here with a Mac with M2 chip? That's apparently cause of my problems. (everything seems to be made for NVidia GPUs)

Amirkhalesi

Nov 12, 2023

Couldn't find a workaround for CPU yet, but worked with transformers==4.35.0 on CUDA

sherbika

Nov 17, 2023

I am using an apple M2 chip and can't get around this error

Petrux99

Nov 24, 2023

I'm on an M1 and also can't get this to work. I dug around a bit quickly in the Transformers source code and found this which seems to indicate there is no hope to get this working for us Mac users:

def is_bitsandbytes_available():
    if not is_torch_available():
        return False

    # bitsandbytes throws an error if cuda is not available
    # let's avoid that by adding a simple check
    import torch

    return _bitsandbytes_available and torch.cuda.is_available()

Note the requirement for torch.cuda.is_available()

Also, bitsandbytes seems to officially only support CUDA, with this issue about supporting MPS being open and not acted upon: https://github.com/TimDettmers/bitsandbytes/issues/252

Apologies if any of the above analysis is wrong. I'm pretty new to this all.

DracoDev

Nov 30, 2023

This is the kind of error that can occur when downgrading transformers to accomodate this error on other new models like Zephry-7b-beta based on mistral.

----> 8 llm = HuggingFaceLLM(
9 model_name="HuggingFaceH4/zephyr-7b-beta",
10 tokenizer_name="HuggingFaceH4/zephyr-7b-beta",

/usr/local/lib/python3.10/dist-packages/transformers/models/auto/configuration_auto.py in getitem(self, key)
669 return self._extra_content[key]
670 if key not in self._mapping:
--> 671 raise KeyError(key)
672 value = self._mapping[key]
673 module_name = model_type_to_module_name(key)
KeyError: 'mistral'

segestic

Dec 1, 2023

If using Colab or any other notebooks, Ensure to change from CPU to GPU. This solved this error in my case!

poundian

Dec 2, 2023

@vishyrjun 's answer was the solution for me, using AutoTrain Advanced hosted UI.

JohnLaw

Dec 4, 2023

I was trying this in Google Colab and keep getting this error as well. I ensured accelerate and bitsandbytes are installed. still keeps getting this issue. Then realized i was having a particular line in my train command that was causing this issue

autotrain llm --train --project_name '<project_name>' \
--model TinyPixel/Llama-2-7B-bf16-sharded \
--data_path timdettmers/openassistant-guanaco \
--text_column text \
--use_int4 \
--use_peft \
--learning_rate 2e-4 \
--train_batch_size 2 \
--num_train_epochs 3 \
--trainer sft \
--model_max_length 2048 \
--push_to_hub \
--repo_id <repo_id>/<project_name> \
--block_size 2048 > training.log &

After i removed --use_int4 line and executed, the issue got resolved. I hope this helps. Please ensure to use '\' at the end of everyline incase of using this command to train.

Thanks.

Thanks! it works for me in this case in Jupyter lab:

quantization_config = BitsAndBytesConfig(
# load_in_4bit=True,
bnb_4bit_compute_dtype=torch.float16,
bnb_4bit_quant_type="nf4",
bnb_4bit_use_double_quant=True,
)

serkanarslan

Dec 10, 2023

Hello Everyone,

I was getting the same error and thought the problem might be related to pytorch.

https://stackoverflow.com/questions/60987997/why-torch-cuda-is-available-returns-false-even-after-installing-pytorch-with the solution here worked for me.

After checking the cuda with torch.zeros(1).cuda(), I received the error that the video card was not up to date, and after installing the updated video card driver, the problem disappeared.

Marios88

Dec 21, 2023

!pip install transformers==4.34.0 worked for me in Google Colab GPU

sumittyagi25

Dec 29, 2023

I am using Mac M2 chip.
transformers=4.30.0 worked for me,

programmeurke

Jan 3

•

edited Jan 3

@sumittyagi25 thanks for sharing ! Are you also using the M2 chip with PyTorch, or the CPU? Do you use model.to('mps')? Or can you share an example script ? Thanks

Naveen2000

Feb 18

i installed 4.30 but still it is snot working

carolher

Mar 27

•

edited Mar 27

@sumittyagi25 thank you! this worked for me

ritwikm

Apr 2

@apshah If the problem persists even after restarting the colab session, then make sure you are running your code on the GPU. This line in transformers library needs cuda enabled environment for accelerate to be loaded.

anon8231489123
/

vicuna-13b-GPTQ-4bit-128g

I keep getting this: ImportError: Using `load_in_8bit=True` requires Accelerate

TODO: This doesn't work for all packages (`bs4`, `faiss`, etc.) Talk to Sylvain to see how to do with it better.

I keep getting this: ImportError: Using `load_in_8bit=True` requires Accelerate

TODO: This doesn't work for all packages (bs4, faiss, etc.) Talk to Sylvain to see how to do with it better.

TODO: This doesn't work for all packages (`bs4`, `faiss`, etc.) Talk to Sylvain to see how to do with it better.