Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
THUDM
/
chatglm-6b-int8
like
67
Transformers
PyTorch
Chinese
English
chatglm
glm
thudm
custom_code
Inference Endpoints
Model card
Files
Files and versions
Community
1
Train
Deploy
Use in Transformers
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
推理相比量化前更慢了
2
#1 opened 11 months ago by
luobotaxinghu