Intel
/

xlnet-base-cased-mrpc-int8-static-inc

Text Classification

text-classfication

neural-compressor

Intel® Neural Compressor

PostTrainingStatic

Inference Endpoints

Model card Files Files and versions Community

1pikachu1111 commited on Jun 27, 2023

Commit

5bea157

•

1 Parent(s): c731696

update int8 onnx model and readme

Signed-off-by: dujun <jun.du@intel.com>

Files changed (2) hide show

README.md +3 -3
model.onnx +2 -2

README.md CHANGED Viewed

@@ -63,14 +63,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
 The original fp32 model comes from the fine-tuned model [xlnet-base-cased-mrpc](https://huggingface.co/Intel/xlnet-base-cased-mrpc).
-The calibration dataloader is the eval dataloader. The default calibration sampling size 100 isn't divisible exactly by batch size 8. So the real sampling size is 104.
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
-| **Accuracy (eval-f1)** |0.8935|0.8986|
-| **Model size (MB)**  |286|448|
 #### Load ONNX model:

 The original fp32 model comes from the fine-tuned model [xlnet-base-cased-mrpc](https://huggingface.co/Intel/xlnet-base-cased-mrpc).
+The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
+| **Accuracy (eval-f1)** |0.8974|0.8986|
+| **Model size (MB)**  |226|448|
 #### Load ONNX model:

model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5c71602889b26bf3079ca71f4bb721481d24d6f082d9edada98d3dfe41e9454d
-size 299662965

 version https://git-lfs.github.com/spec/v1
+oid sha256:7299d27322dd1ca9fc9b3a98a064eddbc590a2bbf1d254df79e89463432113fd
+size 236931543