Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
chatglm-6b-int4
like
413
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
Inference Endpoints
arxiv:
2406.12793
Model card
Files
Files and versions
Community
20
Train
Deploy
Use this model
main
chatglm-6b-int4
6 contributors
History:
39 commits
yuxiaod
Update README.md
826ca34
verified
2 months ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
LICENSE
11.3 kB
init commmit
over 1 year ago
MODEL_LICENSE
4.27 kB
Update license
about 1 year ago
README.md
5.61 kB
Update README.md
2 months ago
config.json
838 Bytes
Sync with chatglm-6b
over 1 year ago
configuration_chatglm.py
4.38 kB
Slim embedding
over 1 year ago
ice_text.model
2.71 MB
LFS
Drop icetk dependency
over 1 year ago
modeling_chatglm.py
59.4 kB
Sync with chatglm-6b
over 1 year ago
pytorch_model.bin
pickle
Detected Pickle imports (4)
"torch._utils._rebuild_tensor_v2"
,
"torch.HalfStorage"
,
"torch.CharStorage"
,
"collections.OrderedDict"
What is a pickle import?
3.89 GB
LFS
Upload pytorch_model.bin
over 1 year ago
quantization.py
31 kB
Add support for parallel quantization on Mac
over 1 year ago
quantization_kernels.c
1.16 kB
init commmit
over 1 year ago
quantization_kernels_parallel.c
1.61 kB
Fix parallel cpu kernel
over 1 year ago
tokenization_chatglm.py
17 kB
Update decode method in tokenizer
over 1 year ago
tokenizer_config.json
446 Bytes
Fix eos token in tokenizer
over 1 year ago