datasets:
- gozfarb/ShareGPT_Vicuna_unfiltered
TEST FINISHED FOR NOW. I MOVED TO 30B training.
Convert tools
https://github.com/practicaldreamer/vicuna_to_alpaca
Training tool
https://github.com/oobabooga/text-generation-webui
ATM I'm using v4.3 of the dataset and training full context.
This LoRA is already feels fully functional. To use this LoRA please replace the config files to ones of Vicuna and I will have them here. Other than that use normal llama then replace the config files then load LoRA.
checkpoint-9728-failed: This first test used the original format from the convert tool, but it was later found out this caused broken context. It would work as expected from the initial prompt but the moment you asked it a question about anything in the past it would say something random. I have since restarted training with the new format B from the tool and it seems to have fixed the issue with the original format.
How to test?
- Download LLaMA-13B-HF: https://huggingface.co/Neko-Institute-of-Science/LLaMA-13B-HF
- Replace special_tokens_map.json and tokenizer_config.json using the ones on this repo.
- Rename LLaMA-13B-HF to vicuna-13b
- Load ooba:
python server.py --listen --model vicuna-13b --load-in-8bit --chat --lora checkpoint-xxxx
- Instruct mode: Vicuna-v1 it will load Vicuna-v0 by defualt