GGUF Model Long Context Infinite Thinking

#3
by xldistance - opened

Both the GGUF I converted and other community-converted versions of Step Flash 3.7 GGUF suffer from an infinite thinking issue with long contexts — not repetitive responses, but endless reasoning that never concludes. Only the officially released GGUF works correctly.

The performance of the GGUF model has also degraded significantly. For example, when my input contains the word "strategy", the model incorrectly claims there is a spelling error and treats it as "strateg".

Unsloth AI org

That's weird - will check. We did KLD of StepFun's official vs AesSedai and ours:

image

So on KLD, it looks fine

It's unclear whether this is an issue with the model itself.

xldistance changed discussion status to closed

Also see https://huggingface.co/unsloth/Step-3.7-Flash-GGUF/discussions/1#6a1a84c7ac67b51dd73e9ffe

The 'strategies' in the code was recognized as 'strategٹ' — probably a model issue.
QQ20260530-143423

Unsloth AI org

@xldistance wait is this super helpful - let me check!

The official IQ4_XS has this issue too......

Sign up or log in to comment