NeverSleep
/

Llama-3-Lumimaid-8B-v0.1-GGUF

GGUF

Not-For-All-Audiences

nsfw

Inference Endpoints

Model card Files Files and versions Community

Llama 3 chat template not working?

by Jdods84 - opened May 31

Discussion

Jdods84

May 31

•

edited May 31

When I use the Llama 3 chat template for my prompt, Lumimaid seems to repeat responses and loses its mind. When I remove the formatting and use my Alpaca format, it RPs and works fine. I am using KoboldCPP to run this LLM locally. Would it be possible to get some help from an expert with formatting my prompt correctly?

Undi95

NeverSleep org May 31

It work for me on kobold, I use the one I put on the model card (Llama-3-Instruct)
It also work unquantized, the prompt format is on the tokenizer_config.json

Can you check that and be sure you use the right one? Updated all your tools ?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment