GGUF Please

by HR1777 - opened Jan 7, 2024

Discussion

HR1777

Jan 7, 2024

@TheBloke Please make the GGUF version of this model

compilade

Mar 3, 2024

•

edited Mar 8, 2024

llama.cpp support for Mamba is coming soon, see https://github.com/ggerganov/llama.cpp/pull/5328

Converting requires adding at least "architectures": ["MambaForCausalLM"], to config.json, though.

Tibbnak

Mar 9, 2024

Got merged

count-zero

Mar 11, 2024

@jondurbin Please add the missing "architectures": ["MambaForCausalLM"], line to the config.json, so that it can be quantized with llama.cpp without any further manipulation.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment