chatglm-6b-int4 / quantization.py

Commit History

Add support for parallel quantization on Mac
f6b88da

zxdu20 commited on

Remove assert in load_cpu_kernel
63d66b0

zxdu20 commited on

Sync with chatglm-6b
f55a108

zxdu20 commited on

Add assertion when loading cpu and cuda kernel fails
630d0ef

songxxzp commited on

Add assertion when loading cpu and cuda kernel fails
bcc35f0

songxxzp commited on

Update CPU kernel loading method
c7d8998

songxxzp commited on

Fix bugs when compiling cpu kernels
68873da

DrSong commited on

Synchronize with chatglm 6b repo
7aaf3fe

DrSong commited on

Fix parallel cpu kernel
7458231

DrSong commited on

Fix bugs in quantization when loading kernels
dac03c3

DrSong commited on

init commmit
a93efa9

Sengxian commited on