https://huggingface.co/vicgalle/Configurable-Llama-3-8B-v0.3

#30

by deleted - opened Apr 20

deleted

Apr 20

Thanks for doing v0.2. It had an end token issue that causes GPT4All to freeze and koboldcpp to continue for a while after the response. The author said v0.3 might fix this issue.

mradermacher

Owner Apr 20

The freeze and koboldcpp issues are bugs in those programs, for koboldcpp there is a fixed version available, but the correct configuration must be used. the "fix" that is usually applied breaks the tokenizer, even though it superficially works.

not that i mind quanting v.03, of course :) it's in the queue and should be available in a few hours. cheers!

mradermacher changed discussion status to closed Apr 20

deleted

Apr 20

@mradermacher Thanks for quantizing v0.3, and I respect your decision to stick to the standard. It's been over a month since GPT4All has been updated, which is about a year in LLM time, but a new release appears imminent.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment