End of training

Browse files

Files changed (11) hide show

README.md +5 -11
compressed_graph.dot +0 -0
logs/events.out.tfevents.1700312106.0a848c80699a.1576.0 +3 -0
logs/events.out.tfevents.1700312559.0a848c80699a.1576.1 +3 -0
nncf_output.log +188 -0
openvino_config.json +60 -0
openvino_model.bin +3 -0
openvino_model.xml +0 -0
original_graph.dot +0 -0
pytorch_model.bin +2 -2
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.9151376146788991
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/bert_uncased_L-6_H-768_A-12](https://huggingface.co/google/bert_uncased_L-6_H-768_A-12) on the glue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3683
-- Accuracy: 0.9151
 ## Model description
@@ -58,20 +58,14 @@ The following hyperparameters were used during training:
 - seed: 33
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 7
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
-| 0.2388        | 1.0   | 527  | 0.2575          | 0.9060   |
-| 0.1196        | 2.0   | 1054 | 0.2980          | 0.9002   |
-| 0.0789        | 3.0   | 1581 | 0.2825          | 0.9071   |
-| 0.0529        | 4.0   | 2108 | 0.3194          | 0.9071   |
-| 0.0364        | 5.0   | 2635 | 0.3683          | 0.9151   |
-| 0.0236        | 6.0   | 3162 | 0.4103          | 0.9094   |
-| 0.0154        | 7.0   | 3689 | 0.4751          | 0.9083   |
 ### Framework versions

     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.908256880733945
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [google/bert_uncased_L-6_H-768_A-12](https://huggingface.co/google/bert_uncased_L-6_H-768_A-12) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2574
+- Accuracy: 0.9083
 ## Model description
 - seed: 33
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1.0
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|
+| 0.244         | 1.0   | 527  | 0.2574          | 0.9083   |
 ### Framework versions

compressed_graph.dot ADDED Viewed

The diff for this file is too large to render. See raw diff

logs/events.out.tfevents.1700312106.0a848c80699a.1576.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3c33d221ab0910be5d758fb04c7aebf629f244f1223c6b9112c2c2d89a6e89aa
+size 5069

logs/events.out.tfevents.1700312559.0a848c80699a.1576.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:550e6d05743dca4c9e7b90c2cba8bda631811f82e98e62c4ea67e9d7cdc37b45
+size 411

nncf_output.log ADDED Viewed

	@@ -0,0 +1,188 @@

+INFO:nncf:Not adding activation input quantizer for operation: 7 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[position_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 4 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[word_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 5 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[token_type_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 6 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 8 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__iadd___0
+INFO:nncf:Not adding activation input quantizer for operation: 9 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 10 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/Dropout[dropout]/dropout_0
+INFO:nncf:Not adding activation input quantizer for operation: 23 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 26 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 32 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 33 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 37 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 38 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 51 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 54 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 60 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 61 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 65 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 66 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 79 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 82 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 88 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 89 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 93 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 94 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 107 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 110 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 116 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 117 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 121 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 122 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 135 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 138 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 144 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 145 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 149 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 150 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 163 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 166 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 172 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 173 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 177 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 178 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Collecting tensor statistics |█████           | 1 / 3
+INFO:nncf:Collecting tensor statistics |██████████      | 2 / 3
+INFO:nncf:Collecting tensor statistics |████████████████| 3 / 3
+INFO:nncf:Compiling and loading torch extension: quantized_functions_cuda...
+INFO:nncf:Not adding activation input quantizer for operation: 7 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[position_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 4 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[word_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 5 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFEmbedding[token_type_embeddings]/embedding_0
+INFO:nncf:Not adding activation input quantizer for operation: 6 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 8 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/__iadd___0
+INFO:nncf:Not adding activation input quantizer for operation: 9 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 10 BertForSequenceClassification/BertModel[bert]/BertEmbeddings[embeddings]/Dropout[dropout]/dropout_0
+INFO:nncf:Not adding activation input quantizer for operation: 23 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 26 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 32 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 33 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 37 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 38 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[0]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 51 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 54 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 60 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 61 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 65 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 66 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[1]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 79 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 82 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 88 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 89 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 93 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 94 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[2]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 107 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 110 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 116 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 117 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 121 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 122 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[3]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 135 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 138 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 144 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 145 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 149 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 150 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[4]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 163 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 166 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfAttention[self]/matmul_1
+INFO:nncf:Not adding activation input quantizer for operation: 172 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 173 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertAttention[attention]/BertSelfOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Not adding activation input quantizer for operation: 177 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/__add___0
+INFO:nncf:Not adding activation input quantizer for operation: 178 BertForSequenceClassification/BertModel[bert]/BertEncoder[encoder]/ModuleList[layer]/BertLayer[5]/BertOutput[output]/NNCFLayerNorm[LayerNorm]/layer_norm_0
+INFO:nncf:Collecting tensor statistics |█████           | 1 / 3
+INFO:nncf:Collecting tensor statistics |██████████      | 2 / 3
+INFO:nncf:Collecting tensor statistics |████████████████| 3 / 3
+INFO:nncf:Compiling and loading torch extension: quantized_functions_cuda...
+INFO:nncf:Finished loading torch extension: quantized_functions_cuda
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.
+INFO:nncf:Statistics of the quantization algorithm:
+Epoch 0 |+--------------------------------+-------+
+Epoch 0 ||        Statistic's name        | Value |
+Epoch 0 |+================================+=======+
+Epoch 0 || Ratio of enabled quantizations | 100   |
+Epoch 0 |+--------------------------------+-------+
+Epoch 0 |
+Epoch 0 |Statistics of the quantization share:
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 ||         Statistic's name         |       Value        |
+Epoch 0 |+==================================+====================+
+Epoch 0 || Symmetric WQs / All placed WQs   | 100.00 % (38 / 38) |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Asymmetric WQs / All placed WQs  | 0.00 % (0 / 38)    |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Signed WQs / All placed WQs      | 100.00 % (38 / 38) |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Unsigned WQs / All placed WQs    | 0.00 % (0 / 38)    |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Per-tensor WQs / All placed WQs  | 0.00 % (0 / 38)    |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Per-channel WQs / All placed WQs | 100.00 % (38 / 38) |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Placed WQs / Potential WQs       | 70.37 % (38 / 54)  |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Symmetric AQs / All placed AQs   | 27.27 % (12 / 44)  |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Asymmetric AQs / All placed AQs  | 72.73 % (32 / 44)  |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Signed AQs / All placed AQs      | 100.00 % (44 / 44) |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Unsigned AQs / All placed AQs    | 0.00 % (0 / 44)    |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Per-tensor AQs / All placed AQs  | 100.00 % (44 / 44) |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 || Per-channel AQs / All placed AQs | 0.00 % (0 / 44)    |
+Epoch 0 |+----------------------------------+--------------------+
+Epoch 0 |
+Epoch 0 |Statistics of the bitwidth distribution:
+Epoch 0 |+--------------+---------------------+--------------------+--------------------+
+Epoch 0 || Num bits (N) | N-bits WQs / Placed |    N-bits AQs /    | N-bits Qs / Placed |
+Epoch 0 ||              |         WQs         |     Placed AQs     |         Qs         |
+Epoch 0 |+==============+=====================+====================+====================+
+Epoch 0 || 8            | 100.00 % (38 / 38)  | 100.00 % (44 / 44) | 100.00 % (82 / 82) |
+Epoch 0 |+--------------+---------------------+--------------------+--------------------+
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.
+WARNING:nncf:You are setting `forward` on an NNCF-processed model object.
+NNCF relies on custom-wrapping the `forward` call in order to function properly.
+Arbitrary adjustments to the forward function on an NNCFNetwork object have undefined behavior.
+If you need to replace the underlying forward function of the original model so that NNCF should be using that instead of the original forward function that NNCF saved during the compressed model creation, you can do this by calling:
+model.nncf.set_original_unbound_forward(fn)
+if `fn` has an unbound 0-th `self` argument, or
+with model.nncf.temporary_bound_original_forward(fn): ...
+if `fn` already had 0-th `self` argument bound or never had it in the first place.

openvino_config.json ADDED Viewed

	@@ -0,0 +1,60 @@

+{
+  "compression": {
+    "algorithm": "quantization",
+    "export_to_onnx_standard_ops": false,
+    "ignored_scopes": [
+      "{re}.*Embedding.*",
+      "{re}.*add___.*",
+      "{re}.*layer_norm_.*",
+      "{re}.*matmul_1",
+      "{re}.*__truediv__.*"
+    ],
+    "initializer": {
+      "batchnorm_adaptation": {
+        "num_bn_adaptation_samples": 0
+      },
+      "range": {
+        "num_init_samples": 300,
+        "type": "mean_min_max"
+      }
+    },
+    "overflow_fix": "disable",
+    "preset": "mixed",
+    "scope_overrides": {
+      "activations": {
+        "{re}.*matmul_0": {
+          "mode": "symmetric"
+        }
+      }
+    }
+  },
+  "input_info": [
+    {
+      "keyword": "input_ids",
+      "sample_size": [
+        128,
+        66
+      ],
+      "type": "long"
+    },
+    {
+      "keyword": "token_type_ids",
+      "sample_size": [
+        128,
+        66
+      ],
+      "type": "long"
+    },
+    {
+      "keyword": "attention_mask",
+      "sample_size": [
+        128,
+        66
+      ],
+      "type": "long"
+    }
+  ],
+  "optimum_version": "1.14.1",
+  "save_onnx_model": false,
+  "transformers_version": "4.35.2"
+}

openvino_model.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:14e7a63cb6ed4e71f0fb41442750168d21fe4cc1f4a932ab908a126c9ffc3058
+size 138739212

openvino_model.xml ADDED Viewed

The diff for this file is too large to render. See raw diff

original_graph.dot ADDED Viewed

The diff for this file is too large to render. See raw diff

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0f7f5f1d23dd59c0a5ff0fbe1f57c2504e1a2a954ab8cee1440ae102e9640adf
-size 267862062

 version https://git-lfs.github.com/spec/v1
+oid sha256:b6286ab040d463864d3a8e9d171df02ea044a878ecd16646b9aef131ecbea578
+size 268172814

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3df994f15823187005e047b4e22092c87282c7d500e76747325096ed2b14a250
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:77fea7f47d6f91af7370043dfc2a8acdf21fdd576a23651c0002ed739a6042b2
 size 4600