zai-org
/

chatglm-6b

Model card Files Files and versions

Question: edge/mobile deployment — anyone tested?

#112

by 3morixd - opened 6 days ago

We benchmark models on 40 phones (Snapdragon 865) at Dispatch AI (FZE, UAE).

Question: has anyone tested this model on mobile/edge? Interested in:

Inference speed (t/s)
Model size after quantization
RAM usage

Happy to share phone farm benchmark results.

Dispatch AI (FZE), Sharjah UAE

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment