runtime error

 Downloading shards: 35%|███▌ | 6/17 [00:29<01:05, 5.91s/it] Downloading shards: 41%|████ | 7/17 [00:37<01:04, 6.42s/it] Downloading shards: 47%|████▋ | 8/17 [00:40<00:49, 5.49s/it] Downloading shards: 53%|█████▎ | 9/17 [00:47<00:48, 6.05s/it] Downloading shards: 59%|█████▉ | 10/17 [00:52<00:38, 5.46s/it] Downloading shards: 65%|██████▍ | 11/17 [01:00<00:37, 6.28s/it] Downloading shards: 71%|███████ | 12/17 [01:04<00:27, 5.58s/it] Downloading shards: 76%|███████▋ | 13/17 [01:12<00:25, 6.41s/it] Downloading shards: 82%|████████▏ | 14/17 [01:16<00:16, 5.64s/it] Downloading shards: 88%|████████▊ | 15/17 [01:27<00:14, 7.39s/it] Downloading shards: 94%|█████████▍| 16/17 [01:29<00:05, 5.62s/it] Downloading shards: 100%|██████████| 17/17 [01:38<00:00, 6.54s/it] Downloading shards: 100%|██████████| 17/17 [01:38<00:00, 5.77s/it] Loading checkpoint shards: 0%| | 0/17 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 17/17 [00:00<00:00, 432140.41it/s] Traceback (most recent call last): File "/home/user/app/app.py", line 48, in <module> model = AutoModelForCausalLM.from_pretrained("SageLiao/llama3-LlamaFactory-demo-v3", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3735, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 490, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...