Transformers
English
code
Inference Endpoints

Which inference repo is this quantized for?

#2
by xhyi - opened

Is this quantized for the current starcoder.cpp? Or for upstream ggml or something

I know nothing about starcoder.cpp. Could you provide link please? It's for upstream ggml

Thank you. For now it should work for both.

Sign up or log in to comment