runtime error

MB/s] model.safetensors: 97%|█████████▋| 23.1G/23.8G [02:46<00:04, 156MB/s] model.safetensors: 98%|█████████▊| 23.3G/23.8G [02:47<00:03, 155MB/s] model.safetensors: 99%|█████████▊| 23.5G/23.8G [02:48<00:01, 174MB/s] model.safetensors: 100%|█████████▉| 23.8G/23.8G [02:49<00:00, 140MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 6, in <module> model = AutoModelForCausalLM.from_pretrained(model_name_or_path, File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 566, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3535, in from_pretrained model = quantizer.convert_model(model) File "/home/user/.local/lib/python3.10/site-packages/optimum/gptq/quantizer.py", line 229, in convert_model self._replace_by_quant_layers(model, layers_to_be_replaced) File "/home/user/.local/lib/python3.10/site-packages/optimum/gptq/quantizer.py", line 298, in _replace_by_quant_layers self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1) File "/home/user/.local/lib/python3.10/site-packages/optimum/gptq/quantizer.py", line 298, in _replace_by_quant_layers self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1) File "/home/user/.local/lib/python3.10/site-packages/optimum/gptq/quantizer.py", line 298, in _replace_by_quant_layers self._replace_by_quant_layers(child, names, name + "." + name1 if name != "" else name1) [Previous line repeated 1 more time] File "/home/user/.local/lib/python3.10/site-packages/optimum/gptq/quantizer.py", line 282, in _replace_by_quant_layers new_layer = QuantLinear( File "/home/user/.local/lib/python3.10/site-packages/auto_gptq/nn_modules/qlinear/qlinear_exllama.py", line 68, in __init__ assert outfeatures % 32 == 0 AssertionError

Container logs:

Fetching error logs...