https://huggingface.co/jeiku/Personal_4B

#432

by jeiku - opened Nov 16, 2024

Nov 16, 2024

Looking mainly for imatrix 4_0_4_8. This is probably the pinnacle for 4B and was produced by a three step training process. Thanks in advance!

mradermacher

Owner Nov 16, 2024

It's queued. The Q4_0_4_8 quant will be generated as part of the imatrtix ones (hopefully :)

mradermacher changed discussion status to closed Nov 16, 2024

jeiku

Nov 16, 2024

I just realized I may not have fixed the configs on this one yet. But I'd be excited to see if it works without adjusting (it's an axo thing)

Let me know if you have any trouble and I'll fix it asap.

mradermacher

Owner Nov 16, 2024

Well, it already was delayed because my scheduler was surprised by a 4B taking 82GB of disk space :)

jeiku

Nov 16, 2024

Yeah I didn't clean the repo at all.... Sorry, I just had a hankering to chat with this and I made it a few months back. If you give me a few minutes I can clean it up.

jeiku

Nov 16, 2024

https://huggingface.co/jeiku/Personal_4B/tree/main

Okay, deleted all the junk and made the config changes that worked during the initial testing phase. Sorry about that. This should give you no issues.

mradermacher

Owner Nov 16, 2024

•

edited Nov 16, 2024

All works fine it seems. And it's totally fine to have checkpoints in the repo etc., it's just that the disk space budget was suddenly negative, but I caught it in time.

mradermacher

Owner Nov 16, 2024

Also, the config changes are not reflected in the gguf because the download came first - should I redo it?

jeiku

Nov 16, 2024

If the GGUF works then it works, but I was not able to inference the GGUF created from the old configs for several sisters of this model.

jeiku

Nov 16, 2024

It appears that axolotl somehow alters the parameters. Have verified with several colleagues that this occurs.

mradermacher

Owner Nov 16, 2024

Well, a common issue is that transformers have amultitude of tokenizers, even when using only the in-built one transformers might only use the fasttokenizer, while llama.cpp might want to read the old format and so on.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment