Use chatML. Initial finetune pass with general data.
Not specialized for RP and no RL tuning so its a bit unfocused.