Qwen-14B-Chat-Int4 / modeling_qwen.py

Commit History

update modeling_qwen.py
5d28542

yangapku commited on

update batch inference
4b4dcdc

yangapku commited on

softmax_in_fp32
6c6ec1c

yangapku commited on

update modeling_qwen.py
5d52159

yangapku commited on

update kernels
b980709

yangapku commited on

update modeling_qwen.py
45eb93c

yangapku commited on

update modeling_qwen.py
0f5e18f

yangapku commited on

update kvcache
a828abf

yangapku commited on

update readme
f47dcd2

yangapku commited on

update batch infer
d83208a

yangapku commited on

upload model
ac4ce9b

yangapku commited on