--- license: apache-2.0 --- This is SFT built on a mix of public datasets. Setting up for DPO with custom data. This is a finetune of Mistrial. It should exhibit a broad base of instuction tuning and some other fun roleplaying capablities. Its being trained this is about 50% done.