runtime error

in any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. cpp_kernels.py: 0%| | 0.00/1.92k [00:00<?, ?B/s] cpp_kernels.py: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 1.92k/1.92k [00:00<00:00, 15.8MB/s] A new version of the following files was downloaded from https://huggingface.co/xun/Qwen-Audio-Chat-Int4: - cpp_kernels.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. A new version of the following files was downloaded from https://huggingface.co/xun/Qwen-Audio-Chat-Int4: - modeling_qwen.py - qwen_generation_utils.py - cpp_kernels.py . Make sure to double-check they do not contain any added malicious code. To avoid downloading new versions of the code file, you can pin a revision. mel_filters.npz: 0%| | 0.00/2.05k [00:00<?, ?B/s] mel_filters.npz: 100%|β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆ| 2.05k/2.05k [00:00<00:00, 11.8MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 268, in <module> main() File "/home/user/app/app.py", line 262, in main model, tokenizer = _load_model_tokenizer(args) File "/home/user/app/app.py", line 60, in _load_model_tokenizer model = AutoModelForCausalLM.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 511, in from_pretrained return model_class.from_pretrained( File "/home/user/.cache/huggingface/modules/transformers_modules/xun/Qwen-Audio-Chat-Int4/8bf83e9b7d84973d48149ada14d0b8eb4843ad5b/modeling_qwen.py", line 1037, in from_pretrained return super().from_pretrained(pretrained_model_name_or_path, *model_args, config=config, cache_dir=cache_dir, File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 2489, in from_pretrained raise RuntimeError("GPU is required to quantize or run quantize model.") RuntimeError: GPU is required to quantize or run quantize model.

Container logs:

Fetching error logs...