How you make a uncensored model?

#6
by RGTails - opened

All is in title. ^^

You take a pre-trained base model like llama-2 (e.g. https://huggingface.co/TheBloke/Llama-2-7B-fp16), then run instruction/conversation fine-tuning on a dataset that does not include alignment and refusals, e.g. https://huggingface.co/datasets/ehartford/wizard_vicuna_70k_unfiltered

To actually run the fine-tuning you can take a look at https://georgesung.github.io/ai/qlora-ift/

georgesung changed discussion status to closed

Sign up or log in to comment