GGUF Inference API

With the utilization of the llama-cpp-python package, we are excited to introduce the GGUF model hosted in the Hugging Face Docker Spaces, made accessible through an OpenAI-compatible API. This space includes comprehensive API documentation to facilitate seamless integration.