gguf is required :)

#11

by flymonk - opened Mar 12

Mar 12

Could u support gguf format?
Thank u very much.

Mar 12

Yes i second this, we need gguf

p1ld7a

Mar 12

Looking for the gguf too.

p1ld7a

Mar 12

In the meantime, how could I test it? With Ollama?

Mar 12

read ollama docs on how to create new model

Mar 12

I imagine that @TheBloke is already firing up the stoves.

p1ld7a

Mar 12

Apparently, it won't work with Ollama right now (from their Discord).

Mar 12

I imagine that @TheBloke is already firing up the stoves.

He actually stopped uploading GGUF models like 1 month ago.

Meanwhile I am trying to make it work with HQQ

Mar 13

I was able to convert the safetensors to a GGUF model. I'm still working on adding inference support to llama.cpp.
See PR: https://github.com/ggerganov/llama.cpp/pull/6033

Mar 14

where is the Gguff format this model ?

Mar 14

Mar 15

Starting to add the GGUF files here: https://huggingface.co/andrewcanis/c4ai-command-r-v01-GGUF

Great work Andrew! Both on the GGUF models and specially on the PR you made in Llama.cpp.
Thank you

Mar 18

Is there GPTQ or AWQ version?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment