nougat-small-onnx-quant_avx512_vnni

This was quantized from pszemraj/nougat-small-onnx using the --avx512_vnni flag. You need to have a processor with avx512_vnni instructions for this to work properly.

Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Collection including pszemraj/nougat-small-onnx-quant_avx512_vnni