AlexKoff88
/

bert_nm_mnli_sparse_quantized_90

Text Classification

sequence-classification

Inference Endpoints

Model card Files Files and versions Community

AlexKoff88 commited on Oct 25, 2022

Commit

aa81b56

•

1 Parent(s): 994a4c4

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -1,5 +1,12 @@
 ---
 license: apache-2.0
 ---
  # Quantized BERT-base MNLI model with 90% of usntructured sparsity
  The pruned and quantized model in the OpenVINO IR. The pruned model was taken from this [source](https://huggingface.co/neuralmagic/oBERT-12-downstream-pruned-unstructured-90-mnli) and quantized with the code below using HF Optimum for OpenVINO:
@@ -20,7 +27,7 @@ def preprocess_function(examples, tokenizer):
 # Load the default quantization configuration detailing the quantization we wish to apply
 quantization_config = OVConfig()
 # Instantiate our OVQuantizer using the desired configuration
-quantizer = OVQuantizer.from_pretrained(model)
 # Create the calibration dataset used to perform static quantization
 calibration_dataset = quantizer.get_calibration_dataset(

 ---
 license: apache-2.0
+datasets:
+- mnli
+metrics:
+- accuracy
+tags:
+- sequence-classification
+- int8
 ---
  # Quantized BERT-base MNLI model with 90% of usntructured sparsity
  The pruned and quantized model in the OpenVINO IR. The pruned model was taken from this [source](https://huggingface.co/neuralmagic/oBERT-12-downstream-pruned-unstructured-90-mnli) and quantized with the code below using HF Optimum for OpenVINO:
 # Load the default quantization configuration detailing the quantization we wish to apply
 quantization_config = OVConfig()
 # Instantiate our OVQuantizer using the desired configuration
+quantizer = OVQuantizer.from_pretrained(model, feature="sequence-classification")
 # Create the calibration dataset used to perform static quantization
 calibration_dataset = quantizer.get_calibration_dataset(