Qwen-7B-Chat-Int4 / modeling_qwen.py

Commit History

update modeling_qwen.py
1e66ba4

yangapku commited on

update
5ff8f11

yangapku commited on

remove fix-sized causal mask
c02ede5

yangapku commited on

add kernel file check in modeling_qwen.py
5bfdae9

yangapku commited on

update modeling.py
8750247

yangapku commited on

update modeling_qwen.py
6ec2d41

yangapku commited on

update batch inference
246a75e

yangapku commited on

softmax_in_fp32
682f4da

yangapku commited on

update modeling_qwen.py
f6d1017

yangapku commited on

update kernels
1581be8

yangapku commited on

update modeling_qwen.py
fcc99d6

yangapku commited on

update modeling_qwen.py
f4b568f

yangapku commited on

update kvcache
0e3568a

yangapku commited on

update readme
8afa075

yangapku commited on

update model
ff5200f

yangapku commited on