SmolLM-360M-Instruct-WebGPU

Running

How I can load a quantized model from my own host

by yoeldcd - opened Aug 1, 2024

Aug 1, 2024

I are trying to load smollm-360M-Instruct as 4q quantization, I specified dtype as '4q' on option object, but the pipeline show me an error of model smoll-360M-Instruct/onnx/model_merged_quantized.onnx not found

yoeldcd

Aug 1, 2024

yoeldcd

Aug 1, 2024

I just configure my own host, but is not reading the correct quantized onnx file (model_q4.onnx)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment