MTP heads?

#2
by jdchmiel - opened

Should MTP be working for me in llama.cpp ? Perhaps a few days old is too old for support? OR does the Q4_K_XL not have the heads preserved?

Unsloth AI org

MTP support in llama.cpp was only for Qwen 3.5 / 3.5 arches for now - others will need to wait

Basic Step 3.5 MTP support has just been merged in llama.cpp:

https://github.com/ggml-org/llama.cpp/pull/23274

Sign up or log in to comment