runtime error

dbytes/cextension.py:34: UserWarning: The installed version of bitsandbytes was compiled without GPU support. 8-bit optimizers, 8-bit multiplication, and GPU quantization are unavailable. warn("The installed version of bitsandbytes was compiled without GPU support. " /home/user/.local/lib/python3.10/site-packages/bitsandbytes/libbitsandbytes_cpu.so: undefined symbol: cadam32bit_grad_fp32 Downloading (…)okenizer_config.json: 0%| | 0.00/695 [00:00<?, ?B/s] Downloading (…)okenizer_config.json: 100%|██████████| 695/695 [00:00<00:00, 6.36MB/s] Downloading tokenizer.model: 0%| | 0.00/500k [00:00<?, ?B/s] Downloading tokenizer.model: 100%|██████████| 500k/500k [00:00<00:00, 237MB/s] Downloading (…)/main/tokenizer.json: 0%| | 0.00/1.84M [00:00<?, ?B/s] Downloading (…)/main/tokenizer.json: 100%|██████████| 1.84M/1.84M [00:00<00:00, 64.0MB/s] Downloading (…)cial_tokens_map.json: 0%| | 0.00/434 [00:00<?, ?B/s] Downloading (…)cial_tokens_map.json: 100%|██████████| 434/434 [00:00<00:00, 2.65MB/s] Downloading (…)lve/main/config.json: 0%| | 0.00/610 [00:00<?, ?B/s] Downloading (…)lve/main/config.json: 100%|██████████| 610/610 [00:00<00:00, 4.39MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 24, in <module> model = AutoModelForCausalLM.from_pretrained("llSourcell/medllama2_7b",quantization_config=nf4_config) File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2494, in from_pretrained raise ImportError( ImportError: Using `load_in_8bit=True` requires Accelerate: `pip install accelerate` and the latest version of bitsandbytes `pip install -i https://test.pypi.org/simple/ bitsandbytes` or pip install bitsandbytes`

Container logs:

Fetching error logs...