Same with QAT?

#9
by Wladastic - opened

Is it possible to achieve the same with the QAT version of it?

If going even further was a thing, I wish someone did surgeon some layers to be non-transformers like qwen3.6 27b does.
Would be super interesting to have a Gemma4 with the outside-the-box thinking that Qwens have

Sign up or log in to comment