Add/update the quantized ONNX model files and README.md for Transformers.js v3

#17
by whitphx HF Staff - opened

Applied Quantizations

model.onnx

  • int8 (added)
  • uint8 (added)
  • q4 (added)
  • q4f16 (added)
  • bnb4 (added)

README.md is updated

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment