ggml versions of OpenLLaMa 7B v2
For use with llama.cpp.
- Version: version 2 final 1T tokens
- Project: OpenLLaMA: An Open Reproduction of LLaMA
- Model: openlm-research/open_llama_7b_v2
- llama.cpp 4,5,8-bit quantization: build 567(2d5db48) or later
- llama.cpp newer quantization formats: build 616(99009e7) or later
Perplexity
Coming soon...
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no library tag.