Thank you very much!

by ajibawa-2023 - opened Aug 16, 2023

ajibawa-2023

Aug 16, 2023

Hello, Thank you for all the GGML & GPTQ models of Carl & Scarlett. I highly appreciate your work & help. I will post more models very soon. Thanks!

TheBloke

Owner Aug 16, 2023

•

edited Aug 16, 2023

You're very welcome! Thank you for the great new models.

I was going to message you actually, I wanted to let you know a couple of things that would make it easier for people to download your models:

Firstly, your models are in float32. This makes it twice as large to download
Secondly, you output in a single pytorch_model.bin which I imagine is why you then split it up using split - because of the 50GB file limit? That requires the extra cat pytorch_model.bin-a* > pytorch_model.bin step for people downloading, and also means the model can never work automatically from Transformers.

Also, it's usually much quicker to load a model from multiple shards than from one single pytorch_model.bin file. And it takes less RAM to do so.

Fortunately there's an easy way to fix both problems in one. Here's a little script I wrote: https://gist.github.com/TheBloke/8934a51c5572b500c5217f42bfd055a8#file-reshard-py

Run it like this:

 python3 reshard.py --base_model_name_or_path <path to your model> --output_dir <path to output to> --device cpu --max_shard_size '8GiB' --dtype bfloat16

And it will create a float16 model (or bfloat16 model with --dtype bfloat16, which matches the format your models were in) split into chunks like this:

 [venv] tomj@2b8eac64e03a:/workspace/process/carl-33b ᐅ l source
total 61G
drwxrwxrwx 2 tomj tomj 2.9M Aug 16 14:36 .
drwxrwxrwx 5 tomj tomj 3.0M Aug 16 15:44 ..
-rw-rw-rw- 1 tomj tomj 2.3K Aug 16 13:16 .gitattributes
-rw-rw-rw- 1 tomj tomj 1.7K Aug 16 13:16 README.md
-rw-rw-rw- 1 tomj tomj  690 Aug 16 14:37 config.json
-rw-rw-rw- 1 tomj tomj  137 Aug 16 13:52 generation_config.json
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:52 pytorch_model-00001-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:53 pytorch_model-00002-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:53 pytorch_model-00003-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:53 pytorch_model-00004-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:53 pytorch_model-00005-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:54 pytorch_model-00006-of-00008.bin
-rw-rw-rw- 1 tomj tomj 8.0G Aug 16 13:54 pytorch_model-00007-of-00008.bin
-rw-rw-rw- 1 tomj tomj 4.9G Aug 16 13:54 pytorch_model-00008-of-00008.bin
-rw-rw-rw- 1 tomj tomj  44K Aug 16 13:54 pytorch_model.bin.index.json
-rw-rw-rw- 1 tomj tomj  435 Aug 16 13:54 special_tokens_map.json
-rw-rw-rw- 1 tomj tomj 1.8M Aug 16 13:54 tokenizer.json
-rw-rw-rw- 1 tomj tomj 489K Aug 16 13:54 tokenizer.model
-rw-rw-rw- 1 tomj tomj  745 Aug 16 13:54 tokenizer_config.json

Then you can upload that and won't have any 50GB problems, and people can download it and use it immediately, including automatically from Transformers code. And it'll be in float16, not float32, so it only requires half as much time to upload and download.

I still have the sharded files for Carl-33B and Scarlette-33B so if you like I could PR the bf16 sharded version to your repos. Let me know if that'd be helpful.

Thanks again for the great models and looking forward to seeing more! Feel free to ping me when they're up and I'll quantise them.

ajibawa-2023

Aug 16, 2023

Thank you very much for making my & other users life easy. I will use the script for sure!
You surely can PR the bf16 sharded version to my repos. That will be great!
Thank you!

ajibawa-2023

Sep 1, 2023

•

edited Sep 14, 2023

Hello Bloke, sorry to bother you. Heartiest congratulations on winning the a16z grant! You guys deserve it!

ajibawa-2023

Sep 14, 2023

Hello Bloke,
Can you do the GPTQ & GGML for the following models:

https://huggingface.co/ajibawa-2023/Uncensored-Frank-7B
https://huggingface.co/ajibawa-2023/Uncensored-Frank-13B
https://huggingface.co/ajibawa-2023/Uncensored-Frank-33B
Looking forward to hearing from you. Thank you very much for sharing the script and necessary instructions.

ajibawa-2023

Sep 19, 2023

Hello Bloke, Any luck with quantization of above models. I highly appreciate your work for open source community. Thank you!

TheBloke

Owner Sep 19, 2023

Oh sorry, I didn't see this. I'll add them to the queue and do them shortly, in GPTQ, GGUF and AWQ

ajibawa-2023

Sep 19, 2023

Surely, super thankful to you!

TheBloke

Owner Sep 19, 2023

Models are starting to upload now

ajibawa-2023

Sep 20, 2023

Thank you very much Bloke!

ajibawa-2023

Oct 10, 2023

This comment has been hidden

ajibawa-2023

Oct 30, 2023

Hello Bloke,
I hope you are doing great! Can you help me by quantizing my following new models:

Uncensored-Jordan-7B : https://huggingface.co/ajibawa-2023/Uncensored-Jordan-7B
Uncensored-Jordan-13B : https://huggingface.co/ajibawa-2023/Uncensored-Jordan-13B
Uncensored-Jordan-33B : https://huggingface.co/ajibawa-2023/Uncensored-Jordan-33B
Looking forward to hearing from you. Thank you for your relentless efforts. Hats off to you!

TheBloke

Owner Oct 30, 2023

Yes of course, glad to. I'll add them to the queue now

TheBloke

Owner Oct 30, 2023

•

edited Oct 30, 2023

By the way, is there a reason you're still using Llama 1 for 7B?

ajibawa-2023

Oct 30, 2023

Thanks Bloke! It was trained before the release of Mistral.

TheBloke

Owner Oct 31, 2023

These are all done

ajibawa-2023

Oct 31, 2023

Thank you very much man! Kudos to you.

ajibawa-2023

Nov 13, 2023

Hello Bloke,
Trust you are doing great! Can you help me by quantizing my following new models:

Python-Code-13B: https://huggingface.co/ajibawa-2023/Python-Code-13B
Python-Code-33B: https://huggingface.co/ajibawa-2023/Python-Code-33B
Thanks for your guidance & help. I highly appreciate your dedication towards OSS community.
Thank you!

ajibawa-2023

Nov 14, 2023

Hello Bloke,
Looking forward to a positive response. Sorry, if I am troubling you.

TheBloke

Owner Nov 14, 2023

Oh sorry, I missed this. My normal place for model requests is the #model-requests forum in my Discord. So the best results to ensure I see it quickly is to post it there, and ping me there.

I will add these to my queue now. GGUFs and AWQs will come soon, GPTQs in a couple of hours

ajibawa-2023

Nov 14, 2023

Thank you Bloke! I am not on Discord but will join soon. Thank you.

ajibawa-2023

Nov 15, 2023

Hello Bloke,
Thank you very much for quantized models. I am extremely thankful to you.

ajibawa-2023

Nov 30, 2023

Hello Bloke,
How are you? Can you quantize my model: https://huggingface.co/ajibawa-2023/SlimOrca-13B
Thank you very much! Happy Holidays to you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment