Is this DFlash model still under training ?

#9
by sdd5125 - opened

Thank you for your hard work!
Like many other users, I tested the DFlash speculative decoding to find that it underperformed compared to a MTP 3 from the model (in my case AWQ 4bit). Is it planned to or is this model still under training (just like the one destined for Qwen3.6 27B) ?

Sign up or log in to comment