The base model is meta's Llama-2-7b-chat-hf. It was finetuned using SFT and the Anthropic/hh-rlhf dataset and the model prompt is similar to the original Guanaco model. This repo contains the merged fp16 model.
Legal Disclaimer: This model is bound by the usage restrictions of the original Llama-2 model. And comes with no warranty or gurantees of any kind.
- license:
- llama2
- llama2
- datasets:
- Anthropic/hh-rlhf
- Anthropic/hh-rlhf
- language:
- en
- en
- reference: https://gist.github.com/younesbelkada/9f7f75c94bdc1981c8ca5cc937d4a4da