Update README.md

1b28ce1 8 months ago

4.1 kB

	---
	license: apache-2.0
	---
	# GreenBit LLMs

	This is GreenBitAI's pretrained low-bit LLMs with extreme compression yet still strong performance.

	Please refer to our [Github page](https://github.com/GreenBitAI/green-bit-llm) for the code to run the model and more information.

	### Zero-shot Evaluation

	We evaluate the zero-shot ability of low-bit quantized Qwen1.5 models using the `llm_eval` library and list the results below:

	\| Repository (Qwen Family) \| Avg Acc. \| OpenBQ \| ARC-E \| Winogr. \| HellaS. \| ARC-C \| PIQA \| BoolQ \| RACE \| ANLI-R1 \| ANLI-R2 \| ANLI-R3 \| WiC \|
	\|:----------------------------------\|:------------:\|:------------:\|:-----------:\|:-------------:\|:-------------:\|:-----------:\|:----------:\|:-----------:\|:-----------:\|:-------------:\|:-------------:\|:-------------:\|:---------:\|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-2.2` \| 0.398 \| 0.170 \| 0.443 \| 0.527 \| 0.332 \| 0.238 \| 0.634 \| 0.620 \| 0.318 \| 0.332 \| 0.338 \| 0.330 \| 0.500 \|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-2.5` \| 0.394 \| 0.170 \| 0.514 \| 0.541 \| 0.337 \| 0.232 \| 0.637 \| 0.496 \| 0.318 \| 0.316 \| 0.358 \| 0.326 \| 0.490 \|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-3.0` \| 0.407 \| 0.198 \| 0.533 \| 0.536 \| 0.348 \| 0.234 \| 0.671 \| 0.552 \| 0.323 \| 0.330 \| 0.333 \| 0.335 \| 0.495 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-2.2` \| 0.415 \| 0.218 \| 0.539 \| 0.586 \| 0.392 \| 0.260 \| 0.678 \| 0.622 \| 0.333 \| 0.333 \| 0.333 \| 0.336 \| 0.464 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-2.5` \| 0.423 \| 0.222 \| 0.592 \| 0.585 \| 0.406 \| 0.267 \| 0.695 \| 0.629 \| 0.336 \| 0.314 \| 0.339 \| 0.361 \| 0.507 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-3.0` \| 0.438 \| 0.246 \| 0.576 \| 0.563 \| 0.413 \| 0.277 \| 0.694 \| 0.645 \| 0.352 \| 0.323 \| 0.336 \| 0.343 \| 0.492 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-2.2` \| 0.480 \| 0.254 \| 0.663 \| 0.623 \| 0.463 \| 0.339 \| 0.712 \| 0.718 \| 0.349 \| 0.326 \| 0.355 \| 0.384 \| 0.513 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-2.5` \| 0.490 \| 0.266 \| 0.677 \| 0.629 \| 0.473 \| 0.365 \| 0.732 \| 0.717 \| 0.351 \| 0.372 \| 0.352 \| 0.360 \| 0.502 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-3.0` \| 0.502 \| 0.268 \| 0.678 \| 0.642 \| 0.494 \| 0.358 \| 0.755 \| 0.757 \| 0.380 \| 0.395 \| 0.395 \| 0.392 \| 0.519 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-2.2` \| 0.513 \| 0.278 \| 0.669 \| 0.654 \| 0.504 \| 0.389 \| 0.741 \| 0.759 \| 0.376 \| 0.383 \| 0.410 \| 0.403 \| 0.517 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-2.5` \| 0.520 \| 0.294 \| 0.705 \| 0.650 \| 0.520 \| 0.387 \| 0.750 \| 0.769 \| 0.371 \| 0.445 \| 0.424 \| 0.398 \| 0.564 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-3.0` \| 0.531 \| 0.292 \| 0.713 \| 0.654 \| 0.545 \| 0.405 \| 0.764 \| 0.807 \| 0.383 \| 0.424 \| 0.393 \| 0.414 \| 0.627 \|
	\| `Qwen-1.5-14B-layer-mix-bpw-2.5` \| 0.553 \| 0.318 \| 0.727 \| 0.682 \| 0.564 \| 0.413 \| 0.775 \| 0.792 \| 0.390 \| 0.472 \| 0.434 \| 0.446 \| 0.623 \|
	\| `Qwen-1.5-32B-layer-mix-bpw-3.0` \| 0.599 \| 0.346 \| 0.775 \| 0.722 \| 0.620 \| 0.492 \| 0.807 \| 0.853 \| 0.444 \| 0.515 \| 0.494 \| 0.478 \| 0.642 \|

	---
	license: apache-2.0
	---
	# GreenBit LLMs

	This is GreenBitAI's pretrained low-bit LLMs with extreme compression yet still strong performance.

	Please refer to our [Github page](https://github.com/GreenBitAI/green-bit-llm) for the code to run the model and more information.

	### Zero-shot Evaluation

	We evaluate the zero-shot ability of low-bit quantized Qwen1.5 models using the `llm_eval` library and list the results below:

	\| Repository (Qwen Family) \| Avg Acc. \| OpenBQ \| ARC-E \| Winogr. \| HellaS. \| ARC-C \| PIQA \| BoolQ \| RACE \| ANLI-R1 \| ANLI-R2 \| ANLI-R3 \| WiC \|
	\|:----------------------------------\|:------------:\|:------------:\|:-----------:\|:-------------:\|:-------------:\|:-----------:\|:----------:\|:-----------:\|:-----------:\|:-------------:\|:-------------:\|:-------------:\|:---------:\|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-2.2` \| 0.398 \| 0.170 \| 0.443 \| 0.527 \| 0.332 \| 0.238 \| 0.634 \| 0.620 \| 0.318 \| 0.332 \| 0.338 \| 0.330 \| 0.500 \|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-2.5` \| 0.394 \| 0.170 \| 0.514 \| 0.541 \| 0.337 \| 0.232 \| 0.637 \| 0.496 \| 0.318 \| 0.316 \| 0.358 \| 0.326 \| 0.490 \|
	\| `Qwen-1.5-0.5B-layer-mix-bpw-3.0` \| 0.407 \| 0.198 \| 0.533 \| 0.536 \| 0.348 \| 0.234 \| 0.671 \| 0.552 \| 0.323 \| 0.330 \| 0.333 \| 0.335 \| 0.495 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-2.2` \| 0.415 \| 0.218 \| 0.539 \| 0.586 \| 0.392 \| 0.260 \| 0.678 \| 0.622 \| 0.333 \| 0.333 \| 0.333 \| 0.336 \| 0.464 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-2.5` \| 0.423 \| 0.222 \| 0.592 \| 0.585 \| 0.406 \| 0.267 \| 0.695 \| 0.629 \| 0.336 \| 0.314 \| 0.339 \| 0.361 \| 0.507 \|
	\| `Qwen-1.5-1.8B-layer-mix-bpw-3.0` \| 0.438 \| 0.246 \| 0.576 \| 0.563 \| 0.413 \| 0.277 \| 0.694 \| 0.645 \| 0.352 \| 0.323 \| 0.336 \| 0.343 \| 0.492 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-2.2` \| 0.480 \| 0.254 \| 0.663 \| 0.623 \| 0.463 \| 0.339 \| 0.712 \| 0.718 \| 0.349 \| 0.326 \| 0.355 \| 0.384 \| 0.513 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-2.5` \| 0.490 \| 0.266 \| 0.677 \| 0.629 \| 0.473 \| 0.365 \| 0.732 \| 0.717 \| 0.351 \| 0.372 \| 0.352 \| 0.360 \| 0.502 \|
	\| `Qwen-1.5-4B-layer-mix-bpw-3.0` \| 0.502 \| 0.268 \| 0.678 \| 0.642 \| 0.494 \| 0.358 \| 0.755 \| 0.757 \| 0.380 \| 0.395 \| 0.395 \| 0.392 \| 0.519 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-2.2` \| 0.513 \| 0.278 \| 0.669 \| 0.654 \| 0.504 \| 0.389 \| 0.741 \| 0.759 \| 0.376 \| 0.383 \| 0.410 \| 0.403 \| 0.517 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-2.5` \| 0.520 \| 0.294 \| 0.705 \| 0.650 \| 0.520 \| 0.387 \| 0.750 \| 0.769 \| 0.371 \| 0.445 \| 0.424 \| 0.398 \| 0.564 \|
	\| `Qwen-1.5-7B-layer-mix-bpw-3.0` \| 0.531 \| 0.292 \| 0.713 \| 0.654 \| 0.545 \| 0.405 \| 0.764 \| 0.807 \| 0.383 \| 0.424 \| 0.393 \| 0.414 \| 0.627 \|
	\| `Qwen-1.5-14B-layer-mix-bpw-2.5` \| 0.553 \| 0.318 \| 0.727 \| 0.682 \| 0.564 \| 0.413 \| 0.775 \| 0.792 \| 0.390 \| 0.472 \| 0.434 \| 0.446 \| 0.623 \|
	\| `Qwen-1.5-32B-layer-mix-bpw-3.0` \| 0.599 \| 0.346 \| 0.775 \| 0.722 \| 0.620 \| 0.492 \| 0.807 \| 0.853 \| 0.444 \| 0.515 \| 0.494 \| 0.478 \| 0.642 \|