runtime error
Traceback (most recent call last): File "/home/user/app/app.py", line 8, in <module> model = AutoModelForCausalLM.from_pretrained(model_name) File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3192, in from_pretrained config.quantization_config = AutoHfQuantizer.merge_quantization_configs( File "/usr/local/lib/python3.10/site-packages/transformers/quantizers/auto.py", line 157, in merge_quantization_configs quantization_config = AutoQuantizationConfig.from_dict(quantization_config) File "/usr/local/lib/python3.10/site-packages/transformers/quantizers/auto.py", line 87, in from_dict return target_cls.from_dict(quantization_config_dict) File "/usr/local/lib/python3.10/site-packages/transformers/utils/quantization_config.py", line 96, in from_dict config = cls(**config_dict) File "/usr/local/lib/python3.10/site-packages/transformers/utils/quantization_config.py", line 804, in __init__ self.post_init() File "/usr/local/lib/python3.10/site-packages/transformers/utils/quantization_config.py", line 811, in post_init raise ValueError("AWQ is only available on GPU") ValueError: AWQ is only available on GPU
Container logs:
Fetching error logs...