Transformers
GGUF
llama
text-generation-inference
TheBloke's picture
GGUF model commit (made with llama.cpp commit a98b163)
3fcc4d4