It generates <0x0A> instead of new line

by Hoioi - opened Dec 10, 2023

Dec 10, 2023

I'm not sure if there's something wrong with the original model or the quantized version, but it generates <0x0A> instead of a going to the new line.

TheBloke

Owner Dec 10, 2023

OK thanks. I'm also not sure if this is because of the original model, or a bug in the code I need to use to make it.

Because the original model doesn't include a tokenizer.model, I need to use a llama.cpp PR that can make GGUF from tokenizer.json. And it currently seems to have this bug.

I will see if I can find a tokenizer.model for this model and do it that way instead.

TheBloke

Owner Dec 10, 2023

•

edited Dec 10, 2023

I've re-created the GGUFs for this model and v1 and newlines are now generated correctly.

lixbo

Dec 10, 2023

@TheBloke hey man, please don't forget to quantize this model neural-chat-7b-v3-3 please.

Hoioi

Dec 10, 2023

Thank you so much for your fast support!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment