Transformers
GGUF
falcon
falcon-40b
long-context
NTK-YaRN
text-generation-inference