Would be nice to provide quantized version like those by https://huggingface.co/TheBloke

Preferrably GPTQ. Thanks.

Hi @tigerinus ,

Doesn't the quantized models provided by TheBloke work for you?

Wait... R u saying TheBloke provides the quantized version of Yi models?

