runtime error

Downloading shards: 0%| | 0/8 [00:00<?, ?it/s] Downloading shards: 12%|█▎ | 1/8 [00:35<04:07, 35.38s/it] Downloading shards: 25%|██▌ | 2/8 [01:00<02:57, 29.59s/it] Downloading shards: 38%|███▊ | 3/8 [01:23<02:13, 26.61s/it] Downloading shards: 50%|█████ | 4/8 [01:55<01:53, 28.45s/it] Downloading shards: 62%|██████▎ | 5/8 [02:22<01:24, 28.10s/it] Downloading shards: 75%|███████▌ | 6/8 [18:12<11:23, 341.57s/it] Downloading shards: 88%|████████▊ | 7/8 [18:26<03:54, 234.46s/it] Downloading shards: 100%|██████████| 8/8 [18:36<00:00, 162.86s/it] Downloading shards: 100%|██████████| 8/8 [18:36<00:00, 139.54s/it] Loading checkpoint shards: 0%| | 0/8 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 8/8 [00:00<00:00, 113359.57it/s] Traceback (most recent call last): File "/home/user/app/app.py", line 24, in <module> model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 563, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3735, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 490, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...