license: apache-2.0 | |
This is SFT built on a mix of public datasets. Setting up for DPO with custom data. | |
This is a finetune of Mistrial. It should exhibit a broad base of instuction tuning and some other fun roleplaying capablities. | |
Its being trained this is about 50% done. |