GGUF
imatrix
conversational

Which llama.cpp version?

#2
by MLDataScientist - opened

Hi,

Can you please mention which version of llama.cpp or fork of it supports this model? Can you please share a GitHub link? I want to ensure that mainline llama.cpp or some fork of it supports it before I download one of the quants. Thanks!

It runs in original llama cpp . No forks or any thing extra required .

The Kaitchup org
β€’
edited 1 day ago

All the M3 GGUFs (not only MoQ GGUFs) need a special version for now. I didn't check whether its merged or not, but I did this:

git clone https://github.com/ggml-org/llama.cpp
cd llama.cpp
git fetch origin pull/24523/head:minimax-m3
git checkout minimax-m3

Sign up or log in to comment