airoboros-7b / README.md
jondurbin's picture
Update README.md
c6851de
|
raw
history blame
987 Bytes
metadata
license: other

Overview

This is a fine-tuned 7b parameter LlaMa model, fine tuned on nearly 100k synthetic instructions generated by my tool airobors

I used a jailbreak prompt to generate the synthetic instructions this time, which resulted in some questionable training data, such as synthesizing drugs, making homemade flamethrowers, etc. Mind you, this is all generated by ChatGPT, not me, so I won't speak for any outputs the model produces.

Training data

I'm still combing through the data a bit to make sure there's nothing blatantly illegal, but I'll publish it soon.

The jailbreak prompt I used is the default prompt in the python code when using the --uncensored flag: (https://github.com/jondurbin/airoboros/blob/main/airoboros/self_instruct.py#L39)

License

The model is licensed under the LLaMA model, and the dataset is licensed under the terms of OpenAI because it uses ChatGPT. Everything else is free.