Hi @mgoin ,How was this model produced? In practice I find that it is the best performing GPTQ quant for llama3-70B.It would be wonderful to replicate the method for llama3.1-70B.Cheers,Adi
· Sign up or log in to comment