Commit History

Update README
98a75b8

kvaishnavi commited on

Fix version typo
4389eeb

kvaishnavi commited on

Upload optimized CPU ONNX models
882b7ff

kvaishnavi commited on

fix(cpu-int4-rtn-block-32): Fixes "(" typo.
791e509
verified

gugarosa commited on

fix(root): Also fixes the tokenizer.json to prevent mismatches.
e85d1b3

Gustavo de Rosa commited on

fix(tokenizer_config): Adjusts rstrip of special tokens.
1d11596

Gustavo de Rosa commited on

fix(root): Disables inference API since it is not supported for ONNX.
cb44c71
verified

gugarosa commited on

Add files for Hugging Face's Optimum
f194e0a

kvaishnavi commited on

Increase RC version
bcdf0b0

kvaishnavi commited on

fix(root): Replaces system by user to improve generation experience.
66cbe1f

gugarosa commited on

Add config.json for tracking downloads
588e367
verified

kvaishnavi commited on

fix(genai_config): Adds extra EOS token to improve chat outputs.
85f5185

gugarosa commited on

Upload Phi-3-mini-128k-instruct ONNX models
fc3f38a

kvaishnavi commited on

initial commit
213ff61
verified

kvaishnavi commited on