Could you please create a q6 and q8 quant as well?

by hrishbhdalal - opened Apr 19, 2024

Apr 19, 2024

This will help the people who have 3x34GB setup. Big fan btw! Hope you guys finetune some models as well, apparently openhermese 2.5 beats llama3b instruct by quite some margin!

emozilla

NousResearch org Apr 20, 2024

3x34? What devilry is this? 😂

hrishbhdalal

Apr 20, 2024

Ahh.. pardon me. I mean 3x24GB (4090). I hope you guys train a killer model soon tho.. openhermese 3 maybe. The openhermese 2.5 is still better than Llama 3 in some benchmarks.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment