Text Generation
Transformers
English
mpt
llm-rs
ggml
text-generation-inference

Support the 8k base, instruct and chat models

#5
by ifire - opened
rustformers org

I'll take a look later and check if it's compatible without any architectural changes. Then i will upload it.

Thanks for taking a look

rustformers org

If the ggml alibi operation supports different context lengths this should work similar to the 30B models.

Sign up or log in to comment