Transformers
GGUF
llama
text-generation-inference
TheBloke's picture
Initial GGUF model commit (model made with llama.cpp commit d59bd97)
03e06c2