weighted/imatrix quants of https://huggingface.co/RLHFlow/LLaMA3-iterative-DPO-final