Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

App Files Files Community

Please, update converting script. Llama.cpp added support for Nemotron and Minitron architectures.

#111

by NikolayKozloff - opened Aug 16

Aug 16

I tried to make a gguf for nvidia/Nemotron-4-Minitron-8B-Base using your HF space but got this error: Error converting to fp16: b'INFO:hf-to-gguf:Loading model: Nemotron-4-Minitron-8B-Base\nERROR:hf-to-gguf:Model NemotronForCausalLM is not supported\n'

ngxson

ggml.ai org Aug 16

The space is restarted and it should be running the latest version now

ngxson changed discussion status to closed Aug 16

Aug 16

•

The space is restarted and it should be running the latest version now

It still doesn't work, i got the same error message again.

reach-vb changed discussion status to open Aug 17

ggml.ai org Aug 17

Hi @NikolayKozloff - there are known issues with the 8B: https://github.com/ggerganov/llama.cpp/pull/8922#issuecomment-2294891881, will require a fix in transformers, should be soon.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment