Add/update the quantized ONNX model files and README.md for Transformers.js v3

#3
by whitphx HF Staff - opened

Applied Quantizations

βœ… Based on decoder_with_past_model.onnx with slimming

↳ q4f16 (added)

βœ… Based on decoder_model.onnx with slimming

↳ q4f16 (added)

βœ… Based on encoder_model.onnx with slimming

↳ q4f16 (added)

βœ… Based on decoder_model_merged.onnx without slimming

↳ fp16 (replaced because it was invalid)
↳ q4f16 (added)

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment