Edit model card

Convert from microsoft/phi-1_5 and BNB 4 bits quantized.

Require onnxruntime>=0.17.0

Downloads last month
2
Inference API (serverless) does not yet support transformers.js models for this pipeline type.

Collection including BricksDisplay/phi-1_5-bnb4