Commit History

Upload optimized CPU ONNX models
882b7ff

kvaishnavi commited on

fix(cpu-int4-rtn-block-32): Fixes "(" typo.
791e509
verified

gugarosa commited on

fix(root): Also fixes the tokenizer.json to prevent mismatches.
e85d1b3

Gustavo de Rosa commited on

fix(tokenizer_config): Adjusts rstrip of special tokens.
1d11596

Gustavo de Rosa commited on

Add files for Hugging Face's Optimum
f194e0a

kvaishnavi commited on

fix(root): Replaces system by user to improve generation experience.
66cbe1f

gugarosa commited on

fix(genai_config): Adds extra EOS token to improve chat outputs.
85f5185

gugarosa commited on

Upload Phi-3-mini-128k-instruct ONNX models
fc3f38a

kvaishnavi commited on