Here are a few GGUF(v2) quantizations of the model conceptofmind/Open-LLongMA-3b
Which is Based on: openlm-research/open_llama_3b
Open LLongMA 3B is a language model trained to have 8192 tokens of context size using linear rope_scaling 0.25, Using 1.0 it will output gibberish.
- Downloads last month
- 157
Hardware compatibility
Log In
to view the estimation
4-bit
5-bit
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
🙋
Ask for provider support
HF Inference deployability: The model has no library tag.