Transformers
GGUF
llama
text-generation-inference
TheBloke's picture
Upload in splits of max 50GB due to HF 50GB limit. (made with llama.cpp commit 465219b)
a8a335e