text-generation-inference error
#5
by
msteele
- opened
when running on TGI I get error:
RuntimeError: weight model.layers.0.self_attn.q_proj.weight does not exist
anyone else have this issue?
@msteele same problem, figure it out? i can get the quantized 13b running, but not 70...
Fantastic, thanks!
Something is wrong with the current dockerfile and cargo chef keeps running me out of memory (>64gb) when trying to docker build
Great, thanks for letting us know. Hopefully that PR will be merged soon and then it will work with the TGI provided Dockerfile
This comment has been hidden