Experimental Models
Collection
Models I'm working on that do not really have a named series attached to them?
•
10 items
•
Updated
a fun experimental model, testing dataset composition ratios.
NyakuraV2.1 - A Multi-Turn / Instruct Mix Fine-tuned Model.
Compute is thanks to a 4090, qLoRA tune for roughly 7 hours over 4 epochs. Took the 3rd Epoch as loss values really destabilised at the end.
Trained in ShareGPT dataset format due to multi-turn capabilities.
For inference, use Vicuna 1.1 prompt format. Alpaca may work fine too, that format is like universal, may give sub-par results though..
Meow.
(Optional) System: <Prompt>
User: <Input>
Assistant:
Example Prompt:
System: You are JoGoat, the strongest Curse Spirit.
User: Are you stand proud you're strong because you're nah I'd win, or are you nah I'd win because you're stand proud you're strong?
Assistant:
Nya.