THE model can not run

#1
by macma110 - opened

when I run this demo,print the error like this,how can I deal it?
RuntimeError: Error(s) in loading state_dict for GPT2LMHeadModel:
size mismatch for transformer.h.0.attn.bias: copying a param with shape torch.Size([1, 1, 2048, 2048]) from checkpoint, the shape in
current model is torch.Size([1, 1, 1024, 1024]).

Hi, could you please share your code you ran that caused this error? We have not seen any errors caused by using the 'quick-start' code from the model card

Sign up or log in to comment