Smaller q8 version

#2
by bigstorm - opened

Howdy,

Great model. I'd love to experiment deeper, any chance you could release a slightly smaller q8 version of the model? I have 3x 3090s and I'm just a hair to short on the VRAM to load it all up.

Thanks for working on this project.

Sign up or log in to comment