Instructions to use zai-org/chatglm-6b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use zai-org/chatglm-6b with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("zai-org/chatglm-6b", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Question: edge/mobile deployment β anyone tested?
#112
by 3morixd - opened
We benchmark models on 40 phones (Snapdragon 865) at Dispatch AI (FZE, UAE).
Question: has anyone tested this model on mobile/edge? Interested in:
- Inference speed (t/s)
- Model size after quantization
- RAM usage
Happy to share phone farm benchmark results.
- Dispatch AI (FZE), Sharjah UAE