prompt format and text-gen

by gandolfi - opened Jun 6, 2024

Discussion

gandolfi

Jun 6, 2024

hello,
how i can configure this prompt format in text-gen ?

{system_prompt}
Human: {prompt}
Assistant: <|EOT|>

thanks

bartowski

Owner Jun 6, 2024

It should work yes, was extracted from the model's chat template

gandolfi

Jun 6, 2024

thanks. When i use text-gen oobabooga . i just have "point" for answer.

https://cdn.discordapp.com/attachments/1089972954353369151/1248248126750593065/image.png?ex=6662f928&is=6661a7a8&hm=07d691722d9f5d317adf055b3f766eb04f603eccafd05723e854a47a2c845565&

bartowski

Owner Jun 6, 2024

Try instruct instead of chat instruct, I personally dislike chat instruct cause it wraps everything in "the following is a chat between a user and a bot" or something..

Your rope frequency also looks off, try setting it to 4 for I think alpha_value (may be the compress one), I've found even when not pushing context some models NEED the rope frequency or they become incoherent

gandolfi

Jun 6, 2024

i have try with instruct only and alpha value to 4. but still incoherent. i will try with ollama.

cgus

Jun 7, 2024

AutoCoder models are based on DeepSeek-Coder that needs to use compress_emb_pos = 4 to extend context from 4k to 16k.
It seems TGW properly reads GGUF rope scaling from metadata from old DeepSeek-Coder models (e.g. TheBloke's) but doesn't recognize it in newer ones.
Most likely GGUF metadata format changed since then and this change wasn't reflected in TGW code where it only recognizes old "rope.scale_linear" entry but in newer GGUFs it's defined in two entries which it doesn't check: "rope.scaling.factor" and "rope.scaling.type".

Imho, since the model was trained with linear scaling, most likely compress_emb_pos should work better here than alpha value.

bartowski

Owner Jun 8, 2024

Thanks @cgus I couldn't remember which was correct, and your comment seems likely to be true for the rest as well!

gandolfi

Jun 8, 2024

thanks. Works now with compress_emb_pos = 4 and an update of text-gen

bartowski

Owner Jun 8, 2024

Awesome glad to hear it!!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment