KeyError: 'transformer.h.10.attn.c_attn.SCB' to run the model on vllm
#1
by
aalinazar
- opened
I tried to run this model with vllm.
From the command line:
vllm serve "brainiac-origin/jais-chat-30b-8bit"
But I got error below:
Traceback (most recent call last):100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 4.94G/4.94G [07:17<00:00, 11.7MB/s]
File "/usr/local/lib/python3.10/multiprocessing/process.py", line 314, in _bootstrapโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ | 4.82G/4.94G [07:17<00:10, 10.8MB/s]
self.run()-00007.safetensors: 100%|โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ| 4.94G/4.94G [07:27<00:00, 10.6MB/s]
File "/usr/local/lib/python3.10/multiprocessing/process.py", line 108, in run
self._target(*self._args, **self._kwargs)
File "/usr/local/lib/python3.10/site-packages/vllm/entrypoints/openai/rpc/server.py", line 230, in run_rpc_server
server = AsyncEngineRPCServer(async_engine_args, usage_context, rpc_path)
File "/usr/local/lib/python3.10/site-packages/vllm/entrypoints/openai/rpc/server.py", line 31, in init
self.engine = AsyncLLMEngine.from_engine_args(
File "/usr/local/lib/python3.10/site-packages/vllm/engine/async_llm_engine.py", line 740, in from_engine_args
engine = cls(
File "/usr/local/lib/python3.10/site-packages/vllm/engine/async_llm_engine.py", line 636, in init
self.engine = self._init_engine(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/vllm/engine/async_llm_engine.py", line 840, in _init_engine
return engine_class(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/vllm/engine/async_llm_engine.py", line 272, in init
super().init(*args, **kwargs)
File "/usr/local/lib/python3.10/site-packages/vllm/engine/llm_engine.py", line 270, in init
self.model_executor = executor_class(
File "/usr/local/lib/python3.10/site-packages/vllm/executor/executor_base.py", line 46, in init
self._init_executor()
File "/usr/local/lib/python3.10/site-packages/vllm/executor/gpu_executor.py", line 39, in _init_executor
self.driver_worker.load_model()
File "/usr/local/lib/python3.10/site-packages/vllm/worker/worker.py", line 182, in load_model
self.model_runner.load_model()
File "/usr/local/lib/python3.10/site-packages/vllm/worker/model_runner.py", line 881, in load_model
self.model = get_model(model_config=self.model_config,
File "/usr/local/lib/python3.10/site-packages/vllm/model_executor/model_loader/init.py", line 19, in get_model
return loader.load_model(model_config=model_config,
File "/usr/local/lib/python3.10/site-packages/vllm/model_executor/model_loader/loader.py", line 344, in load_model
model.load_weights(
File "/usr/local/lib/python3.10/site-packages/vllm/model_executor/models/jais.py", line 366, in load_weights
param = params_dict[name]
KeyError: 'transformer.h.10.attn.c_attn.SCB'
Loading safetensors checkpoint shards: 0% Completed | 0/7 [00:00<?, ?it/s]
I'm using vllm==0.5.5+cu118
Anyone can help to solve this issue?