Consider adding <start_of_context> and <stop_of_context> or similar special tokens for context ingestion.

#13

by qnixsynapse - opened Apr 10, 2024

Apr 10, 2024

Gemma 1.1 is a serious upgrade over the earlier version and I was able to made it work with documents which required some amount of tweaking on the prompt format:

Adding special tokens for context as part of the prompt format will allow the model to differentiate between the context and the query which currently confuses the model a bit.

IMO, tiny models can benefit from it and it will also reduce hallucination.

This is my suggestion. :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment