Intel
/

roberta-base-mrpc-int8-static-inc

Text Classification

text-classfication

Intel® Neural Compressor

neural-compressor

PostTrainingStatic

Inference Endpoints

Model card Files Files and versions Community

1pikachu1111 commited on Jun 27, 2023

Commit

1dc7724

·

1 Parent(s): 4843e13

update int8 onnx model and readme

Signed-off-by: dujun <jun.du@intel.com>

Files changed (2) hide show

README.md +4 -2
model.onnx +2 -2

README.md CHANGED Viewed

@@ -61,12 +61,14 @@ This is an INT8 ONNX model quantized with [Intel® Neural Compressor](https://gi
 The original fp32 model comes from the fine-tuned model [roberta-base-mrpc](https://huggingface.co/Intel/roberta-base-mrpc).
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
-| **Accuracy (eval-f1)** |0.9073|0.9138|
-| **Model size (MB)**  |243|476|
 #### Load ONNX model:

 The original fp32 model comes from the fine-tuned model [roberta-base-mrpc](https://huggingface.co/Intel/roberta-base-mrpc).
+The calibration dataloader is the eval dataloader. The calibration sampling size is 100.
 #### Test result
 |   |INT8|FP32|
 |---|:---:|:---:|
+| **Accuracy (eval-f1)** |0.9100|0.9138|
+| **Model size (MB)**  |294|476|
 #### Load ONNX model:

model.onnx CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:51de3a4577a50af94e72dd8a75d77b9145726d3f83de336a2573fd12180c6075
-size 254669454

 version https://git-lfs.github.com/spec/v1
+oid sha256:315f3dfad2e4344cfc4688634a4909c0505467bb3cec620509ad204af3662cea
+size 307815333