Neko-Institute-of-Science
/

VicUnLocked-13b-LoRA

Model card Files Files and versions Community

VicUnLocked-13b-LoRA / README.md

Neko-Institute-of-Science's picture

Neko-Institute-of-Science

Add usage guide

f190715 about 1 year ago

|

raw history blame

1.54 kB

	---
	datasets:
	- gozfarb/ShareGPT_Vicuna_unfiltered
	---
	# Convert tools
	https://github.com/practicaldreamer/vicuna_to_alpaca

	# Training tool
	https://github.com/oobabooga/text-generation-webui

	ATM I'm using v4.3 of the dataset and training full context.

	This LoRA is already pretty functional but far from finished training. ETA from the start 200 hours.
	To use this LoRA please replace the config files to ones of Vicuna and I will have them here. Other than that use normal llama then replace the config files then load LoRA.

	checkpoint-9728-failed: This first test used the original format from the convert tool, but it was later found out this caused broken context. It would work as expected from the initial prompt but the moment you asked it a question about anything in the past it would say something random.
	I have since restarted training with the new format B from the tool and it seems to have fixed the issue with the original format. I will be uploading checkpoints everyday until it's finished or other issues are found.

	# How to test?
	1. Download LLaMA-13B-HF: https://huggingface.co/Neko-Institute-of-Science/LLaMA-13B-HF
	2. Replace special_tokens_map.json and tokenizer_config.json using the ones on this repo.
	3. Rename LLaMA-13B-HF to vicuna-13b
	4. Load ooba: ```python server.py --listen --model vicuna-13b --load-in-8bit --chat --lora checkpoint-xxxx```
	5. Instruct mode: Vicuna-v1 it will load Vicuna-v0 by defualt


	# Track Training?
	https://wandb.ai/neko-science/VicUnLocked?workspace=user-neko-science