Phi-3-small-8k-instruct-onnx-cuda / cuda-int4-rtn-block-32

Commit History

Upload Phi-3-small-8k-instruct ONNX models
bee0906

kvaishnavi commited on