text-generation-inference error

#5
by msteele - opened

when running on TGI I get error:
RuntimeError: weight model.layers.0.self_attn.q_proj.weight does not exist
anyone else have this issue?

@msteele same problem, figure it out? i can get the quantized 13b running, but not 70...

Fantastic, thanks!

@TheBloke @fxmarty @msteele Finally got it to run, ended up having to use a older text-gen-inference server dockerfile combined with fxmarty's changes and the updated transformers version in the server requirements.txt.

Something is wrong with the current dockerfile and cargo chef keeps running me out of memory (>64gb) when trying to docker build

Great, thanks for letting us know. Hopefully that PR will be merged soon and then it will work with the TGI provided Dockerfile

This comment has been hidden

Sign up or log in to comment