Intel
/

Qwen3-Next-80B-A3B-Instruct-int4-AutoRound

Text Generation

4-bit precision

Model card Files Files and versions

wenhuach commited on Sep 18

Commit

6bae0a3

·

verified ·

1 Parent(s): f5cdcd3

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -11,6 +11,9 @@ This model is a int4 model with group_size 128 and symmetric quantization of [Qw
 Please follow the license of the original model.
 ## How To Use
 ### INT4 Inference
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer

 Please follow the license of the original model.
 ## How To Use
+For vllm, this pr is required https://github.com/vllm-project/vllm/pull/24818
 ### INT4 Inference
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer