GPU resources

#3
by shon711 - opened

Hi,
Thanks for sharing this!
What is the minimum GPU hardware requirements for running this?

Thanks!

It's the same as the standard Llama-7b. The answer depends on the precision in which you load you weights. If you load it in 4bit, which is how it was trained, that's about 4GB to load the model. You need some additional memory to do inference, so let's say at least 6GB as a minimum requirement.

Thanks! Does loading in 8bit increases the accuracy?

I haven't done a systematic analysis of the performance with 4bit and 8bit. Typically the performance increases with higher bits, but since this model was trained with 4bit precision, that may not be true in this case.

Sign up or log in to comment