Model produces garbage

by WaveCut - opened May 7, 2024

May 7, 2024

•

edited May 7, 2024

I'm using the Q4_1 quant variant, and regardless of what settings I've tried, I consistently receive the same output of endless @ until I stop the generation.

There is something wrong with it. How do I resolve this issue?

3thn

Crusoe AI org May 7, 2024

Unfortunately, there are ongoing issues with llama3 tokenization in GGUF and I'm regenerating quants as potential fixes come in. Until then, I would recommend the mlx quants if you're on max otherwise exl2.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment