runtime error

| 2.09G/4.52G [00:15<00:14, 173MB/s] model-00005-of-00006.safetensors: 51%|█████ | 2.32G/4.52G [00:18<00:19, 114MB/s] model-00005-of-00006.safetensors: 63%|██████▎ | 2.85G/4.52G [00:19<00:09, 184MB/s] model-00005-of-00006.safetensors: 69%|██████▉ | 3.14G/4.52G [00:22<00:08, 160MB/s] model-00005-of-00006.safetensors: 74%|███████▍ | 3.37G/4.52G [00:24<00:08, 139MB/s] model-00005-of-00006.safetensors: 79%|███████▉ | 3.57G/4.52G [00:25<00:06, 146MB/s] model-00005-of-00006.safetensors: 93%|█████████▎| 4.20G/4.52G [00:26<00:01, 238MB/s] model-00005-of-00006.safetensors: 100%|█████████▉| 4.52G/4.52G [00:27<00:00, 166MB/s] Downloading shards: 83%|████████▎ | 5/6 [02:25<00:29, 29.18s/it] model-00006-of-00006.safetensors: 0%| | 0.00/524M [00:00<?, ?B/s] model-00006-of-00006.safetensors: 2%|▏ | 10.5M/524M [00:03<02:32, 3.37MB/s] model-00006-of-00006.safetensors: 100%|█████████▉| 524M/524M [00:03<00:00, 159MB/s] Downloading shards: 100%|██████████| 6/6 [02:29<00:00, 20.48s/it] Downloading shards: 100%|██████████| 6/6 [02:29<00:00, 24.89s/it] Traceback (most recent call last): File "/home/user/app/app.py", line 6, in <module> model = AutoModelForCausalLM.from_pretrained(peft_model_id, return_dict=True, load_in_8bit=True, load_in_8bit_fp32_cpu_offload=True, device_map='auto') File "/home/user/.pyenv/versions/3.10.13/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 566, in from_pretrained return model_class.from_pretrained( File "/home/user/.pyenv/versions/3.10.13/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3596, in from_pretrained model = cls(config, *model_args, **model_kwargs) TypeError: MixtralForCausalLM.__init__() got an unexpected keyword argument 'load_in_8bit_fp32_cpu_offload'

Container logs:

Fetching error logs...