GPTQ-for-LLAMA error, RuntimeError: Error(s) in loading state_dict for LlamaForCausalLM: Missing key(s) in state_dict: "model.embed_tokens.weight",

by KongfuAi - opened Jul 18, 2023

Jul 18, 2023

AND size mismatch for lm_head.weight: copying a param with shape torch.Size([49153, 6144]) from checkpoint, the shape in current model is torch.Size([49153, 4096]).

i use the model of main branch

TheBloke

Owner Jul 18, 2023

•

edited Jul 18, 2023

Two options for use in FastChat:

Try one of the models from the other branches
or, Install the GPTQ-for-LLaMa CUDA branch in FastChat (there's instructions in the FastChat GPTQ documentation)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment