My initial testing of this finetune is really impressive, it is a lot more diverse. Unfortunately because it's choosing different tokens, MTP draft acceptance rate tanks, which of course is expected.
Any plans to finetune the draft model as well?
· Sign up or log in to comment