Quantizing this by following your article

by ritvikshandilya - opened Sep 23, 2023

Sep 23, 2023

Hey, I was trying to quantize it by following your article https://mlabonne.github.io/blog/posts/Quantize_Llama_2_models_using_ggml.html but the tokenizer.model file is missing, can you help us how we can use your quantization tutorial (gguf) on colab fine tuning files?

mlabonne

Owner Sep 23, 2023

This error happens because the model you want to quantize (llama-2-7b-meditext) doesn't have a tokenizer in its repo. You can simply download the tokenizer files from Llama-2 (https://huggingface.co/meta-llama/Llama-2-7b-hf/tree/main) and place them in your model's folder.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment