INT8 ONNX version of Felladrin/TinyMistral-248M-Chat-v1 to use with Transformers.js.

Downloads last month
7
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model authors have turned it off explicitly.

Model tree for Felladrin/onnx-TinyMistral-248M-Chat-v1

Quantized
(12)
this model