Qwen
/

Qwen-14B-Chat-Int8

Text Generation

8-bit precision

Model card Files Files and versions Community

Qwen-14B-Chat-Int8 / modeling_qwen.py

Commit History

update modeling_qwen.py

1b03dff

yangapku commited on Dec 6, 2023

update modeling_qwen.py

002013d

yangapku commited on Dec 4, 2023

update modeling_qwen.py

ec98e72

yangapku commited on Dec 3, 2023

update

953a802

yangapku commited on Nov 30, 2023

remove fix-sized causal mask

166eb67

yangapku commited on Nov 14, 2023

add kernel file check in modeling_qwen.py

5ef07ce

yangapku commited on Nov 5, 2023

update modeling.py

1a3186c

yangapku commited on Oct 26, 2023

update modeling_qwen.py

6ca42df

yangapku commited on Oct 16, 2023

update batch inference

7f8da63

yangapku commited on Oct 14, 2023

upload model

9a3c8fc

yangapku commited on Oct 12, 2023