runtime error

/2.12M [00:00<00:00, 45.1MB/s] added_tokens.json: 0%| | 0.00/1.13k [00:00<?, ?B/s] added_tokens.json: 100%|██████████| 1.13k/1.13k [00:00<00:00, 5.63MB/s] special_tokens_map.json: 0%| | 0.00/470 [00:00<?, ?B/s] special_tokens_map.json: 100%|██████████| 470/470 [00:00<00:00, 3.36MB/s] Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. config.json: 0%| | 0.00/816 [00:00<?, ?B/s] config.json: 100%|██████████| 816/816 [00:00<00:00, 5.64MB/s] configuration_phi.py: 0%| | 0.00/2.21k [00:00<?, ?B/s] configuration_phi.py: 100%|██████████| 2.21k/2.21k [00:00<00:00, 12.7MB/s] A new version of the following files was downloaded from https://huggingface.co/mlabonne/phixtral-4x2_8: - configuration_phi.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. modeling_phi.py: 0%| | 0.00/34.4k [00:00<?, ?B/s] modeling_phi.py: 100%|██████████| 34.4k/34.4k [00:00<00:00, 116MB/s] A new version of the following files was downloaded from https://huggingface.co/mlabonne/phixtral-4x2_8: - modeling_phi.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. Traceback (most recent call last): File "/home/user/app/app.py", line 14, in <module> model = AutoModelForCausalLM.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 561, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2897, in from_pretrained raise RuntimeError("No GPU found. A GPU is needed for quantization.") RuntimeError: No GPU found. A GPU is needed for quantization.

Container logs:

Fetching error logs...