Any plans to train the MTP draft model as well?

#4
by dbm000 - opened

My initial testing of this finetune is really impressive, it is a lot more diverse. Unfortunately because it's choosing different tokens, MTP draft acceptance rate tanks, which of course is expected.

Any plans to finetune the draft model as well?

Sign up or log in to comment