Model Optimizer arguments: Common parameters: - Path to the Input Model: /data1/vchua/tld-poc/repo/bert-base-squadv1-pruneofa-90pc-bt-qat-lt/bert-base-squadv1-pruneofa-90pc-bt-qat-lt.onnx - Path for generated IR: /data1/vchua/tld-poc/repo/bert-base-squadv1-pruneofa-90pc-bt-qat-lt/ir - IR output name: bert-base-squadv1-pruneofa-90pc-bt-qat-lt - Log level: ERROR - Batch: Not specified, inherited from the model - Input layers: Not specified, inherited from the model - Output layers: Not specified, inherited from the model - Input shapes: Not specified, inherited from the model - Mean values: Not specified - Scale values: Not specified - Scale factor: Not specified - Precision of IR: FP32 - Enable fusing: True - Enable grouped convolutions fusing: True - Move mean values to preprocess section: None - Reverse input channels: False ONNX specific parameters: - Inference Engine found in: /opt/intel/openvino/python/python3.6/openvino Inference Engine version: 2021.4.2-3974-e2a469a3450-releases/2021/4 Model Optimizer version: 2021.4.2-3974-e2a469a3450-releases/2021/4 [ WARNING ] Convert data type of Parameter "input_ids" to int32 [ WARNING ] Convert data type of Parameter "attention_mask.1" to int32 [ WARNING ] Convert data type of Parameter "input.1" to int32 [ SUCCESS ] Generated IR version 10 model. [ SUCCESS ] XML file: /data1/vchua/tld-poc/repo/bert-base-squadv1-pruneofa-90pc-bt-qat-lt/ir/bert-base-squadv1-pruneofa-90pc-bt-qat-lt.xml [ SUCCESS ] BIN file: /data1/vchua/tld-poc/repo/bert-base-squadv1-pruneofa-90pc-bt-qat-lt/ir/bert-base-squadv1-pruneofa-90pc-bt-qat-lt.bin [ SUCCESS ] Total execution time: 98.21 seconds. [ SUCCESS ] Memory consumed: 2750 MB.