Spaces

bconsolvo
/

Intel-neural-chat-7b-v3-2

Runtime error

App Files Files Community

runtime error

h_model-00002-of-00002.bin: 69%|██████▉ | 3.14G/4.54G [00:19<00:06, 207MB/s][A pytorch_model-00002-of-00002.bin: 75%|███████▌ | 3.41G/4.54G [00:24<00:08, 128MB/s][A pytorch_model-00002-of-00002.bin: 84%|████████▎ | 3.80G/4.54G [00:25<00:04, 161MB/s][A pytorch_model-00002-of-00002.bin: 92%|█████████▏| 4.19G/4.54G [00:26<00:01, 200MB/s][A pytorch_model-00002-of-00002.bin: 100%|█████████▉| 4.54G/4.54G [00:27<00:00, 167MB/s] Downloading shards: 100%|██████████| 2/2 [01:39<00:00, 45.69s/it][A Downloading shards: 100%|██████████| 2/2 [01:39<00:00, 49.57s/it] Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s][A Loading checkpoint shards: 50%|█████ | 1/2 [02:22<02:22, 142.66s/it][A Loading checkpoint shards: 100%|██████████| 2/2 [03:32<00:00, 99.64s/it] [A Loading checkpoint shards: 100%|██████████| 2/2 [03:32<00:00, 106.09s/it] generation_config.json: 0%| | 0.00/111 [00:00<?, ?B/s][A generation_config.json: 100%|██████████| 111/111 [00:00<00:00, 54.6kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 19, in <module> model = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype=torch.bfloat16, device_map="auto", offload_folder="offload", offload_state_dict = True) File "/home/user/.local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 566, in from_pretrained return model_class.from_pretrained( File "/home/user/.local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3773, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/home/user/.local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 438, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...