Phi-3-mini-128k-instruct-onnx / cpu_and_mobile /cpu-int4-rtn-block-32-acc-level-4

Commit History

Upload optimized CPU ONNX models
882b7ff

kvaishnavi commited on

fix(root): Also fixes the tokenizer.json to prevent mismatches.
e85d1b3

Gustavo de Rosa commited on

fix(tokenizer_config): Adjusts rstrip of special tokens.
1d11596

Gustavo de Rosa commited on

Add files for Hugging Face's Optimum
f194e0a

kvaishnavi commited on

fix(root): Replaces system by user to improve generation experience.
66cbe1f

gugarosa commited on

fix(genai_config): Adds extra EOS token to improve chat outputs.
85f5185

gugarosa commited on

Upload Phi-3-mini-128k-instruct ONNX models
fc3f38a

kvaishnavi commited on