Qwen-14B-Chat-Int8 / modeling_qwen.py

Commit History

update modeling_qwen.py
1b03dff

yangapku commited on

update modeling_qwen.py
002013d

yangapku commited on

update modeling_qwen.py
ec98e72

yangapku commited on

update
953a802

yangapku commited on

remove fix-sized causal mask
166eb67

yangapku commited on

add kernel file check in modeling_qwen.py
5ef07ce

yangapku commited on

update modeling.py
1a3186c

yangapku commited on

update modeling_qwen.py
6ca42df

yangapku commited on

update batch inference
7f8da63

yangapku commited on

upload model
9a3c8fc

yangapku commited on