File size: 1,535 Bytes

21608c6
 
 
 
 
 
 
e35ebcb
21608c6
 
5b1e881
21608c6
 
e35ebcb
 
 
 
 
f190715
 
 
 
 
 
 
 
e35ebcb

---
datasets:
- gozfarb/ShareGPT_Vicuna_unfiltered
---
# Convert tools
https://github.com/practicaldreamer/vicuna_to_alpaca

# Training tool
https://github.com/oobabooga/text-generation-webui

ATM I'm using v4.3 of the dataset and training full context.

This LoRA is already pretty functional but far from finished training. ETA from the start 200 hours.
To use this LoRA please replace the config files to ones of Vicuna and I will have them here. Other than that use normal llama then replace the config files then load LoRA.

**checkpoint-9728-failed**: This first test used the original format from the convert tool, but it was later found out this caused broken context. It would work as expected from the initial prompt but the moment you asked it a question about anything in the past it would say something random.
I have since restarted training with the new format B from the tool and it seems to have fixed the issue with the original format. I will be uploading checkpoints everyday until it's finished or other issues are found.

# How to test?
1. Download LLaMA-13B-HF: https://huggingface.co/Neko-Institute-of-Science/LLaMA-13B-HF
2. Replace special_tokens_map.json and tokenizer_config.json using the ones on this repo.
3. Rename LLaMA-13B-HF to vicuna-13b
4. Load ooba: ```python server.py --listen --model vicuna-13b --load-in-8bit --chat --lora checkpoint-xxxx```
5. Instruct mode: Vicuna-v1 it will load Vicuna-v0 by defualt


# Track Training?
https://wandb.ai/neko-science/VicUnLocked?workspace=user-neko-science