fp16 KV I/O for q4 model

#3
by Xenova HF staff - opened
Files changed (1) hide show
  1. onnx/model_q4f16.onnx +2 -2
onnx/model_q4f16.onnx CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:dd9589b8d4724727ebe574e56dcb7e13dce84533de953a5b830c8708adeca854
3
- size 1240708766
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d703e1223828610da52dfb6e35f38bc50e7b6f4a7f270c91b170f02088112623
3
+ size 1240688216