fp16 KV I/O for q4 model
Browse files- onnx/model_q4f16.onnx +2 -2
onnx/model_q4f16.onnx
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d703e1223828610da52dfb6e35f38bc50e7b6f4a7f270c91b170f02088112623
|
3 |
+
size 1240688216
|