stan-hua's picture
AWQ model for meta-llama/Meta-Llama-3-8B-Instruct: {'w_bit': 4, 'zero_point': True, 'q_group_size': 128, 'version': 'GEMM'}
262b0f2 verified