Where is the gguf?

#1
by OracleToes - opened

The Qwen3.6-27B-PRISM-PRO-DQ page references this repo, saying to load the gguf of the eagle-3 drafter, but I don't see any gguf to download. Additionally there is no one-shot llama.cpp patch, there is only a patch for sglang

# 3. EAGLE-3 chain (needs the WIP PR #18039 patches + the RS-rollback fix --
#    a one-shot llama.cpp patch script is documented alongside the drafter:
#    https://huggingface.co/Ex0bit/Qwen3.6-27B-PRISM-EAGLE3)
./llama-server --model Qwen3.6-27B-PRISM-PRO-DQ.gguf \
    --spec-type draft-eagle3 --model-draft <eagle3-drafter.gguf> \
    --spec-draft-n-max 2

I can upload one but llama.cpp does not support eagle-3 yet - you can easily convert the tensors to GGUF using llama.cpp convert

Sign up or log in to comment