runtime error
Exit code: 1. Reason: loading shards: 50%|█████ | 1/2 [00:09<00:09, 9.72s/it][A model-00002-of-00002.safetensors: 0%| | 0.00/3.64G [00:00<?, ?B/s][A model-00002-of-00002.safetensors: 3%|▎ | 115M/3.64G [00:01<00:33, 107MB/s][A model-00002-of-00002.safetensors: 13%|█▎ | 482M/3.64G [00:02<00:12, 254MB/s][A model-00002-of-00002.safetensors: 32%|███▏ | 1.17G/3.64G [00:03<00:05, 450MB/s][A model-00002-of-00002.safetensors: 45%|████▍ | 1.64G/3.64G [00:04<00:04, 429MB/s][A model-00002-of-00002.safetensors: 66%|██████▌ | 2.40G/3.64G [00:05<00:02, 541MB/s][A model-00002-of-00002.safetensors: 84%|████████▎ | 3.04G/3.64G [00:06<00:01, 573MB/s][A model-00002-of-00002.safetensors: 100%|█████████▉| 3.64G/3.64G [00:06<00:00, 522MB/s] Downloading shards: 100%|██████████| 2/2 [00:16<00:00, 8.23s/it][A Downloading shards: 100%|██████████| 2/2 [00:16<00:00, 8.46s/it] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 60787.01it/s] generation_config.json: 0%| | 0.00/215 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 215/215 [00:00<00:00, 1.45MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 15, in <module> model = Gemma3ForConditionalGeneration.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 273, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4531, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 501, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...