Introducing AutoRound int4 algoirhtm

#14
by wenhuach - opened

Hello, first and foremost, I would like to express my gratitude for your exceptional work and for sharing your model with the community. We have recently applied AutoRound to your model, achieving good results . Below are the accuracies, all tested with real quantized models in the same environment , batch_size 16 and zero shot tasks.

Metric BF16 INT4
Avg. 0.4504 0.4470
mmlu 0.5096 0.5053
cmmlu 0.5486 0.5426
ceval 0.5394 0.5223
gsm8k 0.2039 0.2176

Unfortunately, we are unable to upload the quantized model due to licensing constraints. Therefore, we would appreciate it if you could generate it yourself by following the recipe links, and we are here to provide assistance.

Sign up or log in to comment