https://huggingface.co/nvidia/Llama-3_1-Nemotron-51B-Instruct

#306

by Pomni - opened Sep 24

Sep 24

i've seen a benchmark of this in the lm studio server and apparently it's comparable to a 70b model. would like to try it out (i started using the downstairs living room pc which has WAY better specs and an AVX2 cpu over my main AVX pc)

mradermacher

Owner Sep 24

well, let's see if it is supported by llama.cpp. i am a bit skeptical...

mradermacher

Owner Sep 24

yeah, unfortunately:

ERROR:hf-to-gguf:Model DeciLMForCausalLM is not supported

mradermacher changed discussion status to closed Sep 24

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment