metadata
license: apache-2.0
This is SFT built on a mix of public datasets. Setting up for DPO with custom data.
This is a finetune of Mistrial. It should exhibit a broad base of instuction tuning and some other fun roleplaying capablities.
Its being trained this is about 50% done.