Hugging Face
Models
Datasets
Spaces
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
chatglm-6b-int8
like
62
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
Inference Endpoints
Model card
Files
Files and versions
Community
1
Train
Deploy
Use in Transformers
main
chatglm-6b-int8
1 contributor
History:
14 commits
zxdu20
Upload pytorch_model.bin
22906ae
5 months ago
.gitattributes
1.48 kB
initial commit
6 months ago
LICENSE
11.3 kB
Init commit
6 months ago
MODEL_LICENSE
2.35 kB
Init commit
6 months ago
README.md
4.93 kB
Update slack link
5 months ago
config.json
838 Bytes
Init commit
6 months ago
configuration_chatglm.py
4.38 kB
Init commit
6 months ago
ice_text.model
2.71 MB
LFS
Upload ice_text.model
6 months ago
modeling_chatglm.py
59.4 kB
Sync with chatglm-6b
5 months ago
pytorch_model.bin
6.71 GB
LFS
Upload pytorch_model.bin
5 months ago
quantization.py
31 kB
Add support for parallel quantization on Mac
5 months ago
quantization_kernels.c
1.16 kB
Init commit
6 months ago
quantization_kernels_parallel.c
1.61 kB
Init commit
6 months ago
tokenization_chatglm.py
17 kB
Update decode method in tokenizer
5 months ago
tokenizer_config.json
446 Bytes
Init commit
6 months ago