This project contains the onnx and tensorrt model files converted from the chatglm-6b model. The infer scripts for onnx and tensorrt will be refined later
onnx2engine.py used to convert onnx into tensorrt engine, batch is now 1, can be modified according to their own video memory into dynamic batch
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support
HF Inference deployability: The model has no library tag.