anon8231489123's picture
added ggml quantization for cuda model
4ef20dd