I don't suppose you could upload the REAP weight before you quantised them

#1
by infinityai - opened

Thank you for making this REAP

I don't suppose you could upload the REAPd 16bit weights before you quantised them, So that we can make more alternative quantisations

Or could you try and quantise them Using this new quantisation method that this person has developed https://github.com/jjang-ai/jangq

https://jangq.ai

From what I can see he is able to quantise them even more possibly reducing it by another half in size while still keeping whilst still keeping the Reasoning capabilities

Thanks

Sign up or log in to comment