TheBloke/Llama-2-70B-Chat-GGUF does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.

by senthilknzanz - opened Dec 17, 2023

Dec 17, 2023

llama_model = pipeline("text-generation", model="TheBloke/Llama-2-70B-Chat-GGUF", token=access_token)

for the above code, I get the following error:

OSError: TheBloke/Llama-2-70B-Chat-GGUF does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.

swathiKonakanchi

Feb 6, 2024

Facing same error using Pipeline

YaTharThShaRma999

Feb 6, 2024

@swathiKonakanchi this does not work with huggingface transformers pipeline since it doesnt support gguf quanted models
use llama cpp python or ctransformers as it supports gguf models.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment