Gibberish output

by Adriato - opened Jun 17

Jun 17

I only get gibberish output in Oobabooga Text-gen from all Qwen-2 models. No other. Am I missing something?

Owner Jun 17

if you're offloading to cuda, make sure to enable flash attention

Jun 18

•

Still gibberish with flash attention. Enabling no_offload_kqv did the trick, though. Using the Q6-K-quant. Thanks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment