add experience

Browse files

Files changed (9) hide show

README.md +156 -0
runs.json +1728 -0
tensorboard/1657613137.713749/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.1 +3 -0
tensorboard/1657613137.7153692/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.2 +3 -0
tensorboard/1657613137.7166162/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.3 +3 -0
tensorboard/1657613137.7177505/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.4 +3 -0
tensorboard/1657613137.7188103/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.5 +3 -0
tensorboard/1657613137.7203975/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.6 +3 -0
tensorboard/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.0 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,156 @@

+---
+pipeline_tag: image-classification
+datasets:
+- beans
+metrics:
+- accuracy
+tags:
+- vit
+---
+**task**: `image-classification`
+**Backend:** `sagemaker-training`
+**Backend args:** `{'instance_type': 'ml.g4dn.2xlarge', 'supported_instructions': None}`
+**Number of evaluation samples:** `All dataset`
+Fixed parameters:
+* **model_name_or_path**: `nateraw/vit-base-beans`
+* **dataset**:
+    * **path**: `beans`
+    * **eval_split**: `validation`
+    * **data_keys**: `{'primary': 'image'}`
+    * **ref_keys**: `['labels']`
+* **quantization_approach**: `dynamic`
+* **node_exclusion**: `[]`
+* **framework**: `onnxruntime`
+* **framework_args**:
+    * **opset**: `11`
+    * **optimization_level**: `1`
+* **aware_training**: `False`
+Benchmarked parameters:
+* **operators_to_quantize**: `['Add', 'MatMul']`,  `['Add']`,  `[]`
+* **per_channel**: `False`,  `True`
+# Evaluation
+## Non-time metrics
+| operators_to_quantize | per_channel |     | accuracy (original) | accuracy (optimized) |
+| :-------------------: | :---------: | :-: | :-----------------: | :------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |        0.980        |        0.980         |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |        0.980        |        0.980         |
+|       `['Add']`       |   `False`   |  \|  |        0.980        |        0.980         |
+|       `['Add']`       |   `True`    |  \|  |        0.980        |        0.980         |
+|         `[]`          |   `False`   |  \|  |        0.980        |        0.980         |
+|         `[]`          |   `True`    |  \|  |        0.980        |        0.980         |
+## Time metrics
+Time benchmarks were run for 15 seconds per config.
+Below, time metrics for batch size = 1, input length = 32.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           201.25            |            70.30             |  \|  |           5.00            |           14.27            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           203.52            |            72.48             |  \|  |           4.93            |           13.80            |
+|       `['Add']`       |   `False`   |  \|  |           166.03            |            150.93            |  \|  |           6.07            |            6.67            |
+|       `['Add']`       |   `True`    |  \|  |           200.82            |            163.17            |  \|  |           5.00            |            6.13            |
+|         `[]`          |   `False`   |  \|  |           190.99            |            162.06            |  \|  |           5.27            |            6.20            |
+|         `[]`          |   `True`    |  \|  |           155.15            |            162.52            |  \|  |           6.47            |            6.20            |
+Below, time metrics for batch size = 1, input length = 64.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           165.85            |            70.60             |  \|  |           6.07            |           14.20            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           161.41            |            72.71             |  \|  |           6.20            |           13.80            |
+|       `['Add']`       |   `False`   |  \|  |           200.45            |            129.40            |  \|  |           5.00            |            7.73            |
+|       `['Add']`       |   `True`    |  \|  |           154.68            |            136.42            |  \|  |           6.47            |            7.40            |
+|         `[]`          |   `False`   |  \|  |           166.97            |            162.15            |  \|  |           6.00            |            6.20            |
+|         `[]`          |   `True`    |  \|  |           166.32            |            162.81            |  \|  |           6.07            |            6.20            |
+Below, time metrics for batch size = 1, input length = 128.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           199.48            |            70.98             |  \|  |           5.07            |           14.13            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           199.65            |            71.78             |  \|  |           5.07            |           13.93            |
+|       `['Add']`       |   `False`   |  \|  |           199.08            |            137.97            |  \|  |           5.07            |            7.27            |
+|       `['Add']`       |   `True`    |  \|  |           189.93            |            162.45            |  \|  |           5.33            |            6.20            |
+|         `[]`          |   `False`   |  \|  |           191.63            |            162.54            |  \|  |           5.27            |            6.20            |
+|         `[]`          |   `True`    |  \|  |           200.38            |            162.55            |  \|  |           5.00            |            6.20            |
+Below, time metrics for batch size = 4, input length = 32.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           655.84            |            243.33            |  \|  |           1.53            |            4.13            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           661.27            |            221.16            |  \|  |           1.53            |            4.53            |
+|       `['Add']`       |   `False`   |  \|  |           662.84            |            529.28            |  \|  |           1.53            |            1.93            |
+|       `['Add']`       |   `True`    |  \|  |           512.47            |            470.66            |  \|  |           2.00            |            2.13            |
+|         `[]`          |   `False`   |  \|  |           562.81            |            501.77            |  \|  |           1.80            |            2.00            |
+|         `[]`          |   `True`    |  \|  |           505.81            |            521.20            |  \|  |           2.00            |            1.93            |
+Below, time metrics for batch size = 4, input length = 64.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           654.58            |            258.54            |  \|  |           1.53            |            3.93            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           617.44            |            234.05            |  \|  |           1.67            |            4.33            |
+|       `['Add']`       |   `False`   |  \|  |           661.51            |            478.81            |  \|  |           1.53            |            2.13            |
+|       `['Add']`       |   `True`    |  \|  |           657.01            |            660.23            |  \|  |           1.53            |            1.53            |
+|         `[]`          |   `False`   |  \|  |           661.64            |            474.28            |  \|  |           1.53            |            2.13            |
+|         `[]`          |   `True`    |  \|  |           661.29            |            471.09            |  \|  |           1.53            |            2.13            |
+Below, time metrics for batch size = 4, input length = 128.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           654.80            |            219.38            |  \|  |           1.53            |            4.60            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           663.50            |            222.37            |  \|  |           1.53            |            4.53            |
+|       `['Add']`       |   `False`   |  \|  |           625.56            |            529.02            |  \|  |           1.60            |            1.93            |
+|       `['Add']`       |   `True`    |  \|  |           655.08            |            499.41            |  \|  |           1.53            |            2.07            |
+|         `[]`          |   `False`   |  \|  |           655.92            |            473.01            |  \|  |           1.53            |            2.13            |
+|         `[]`          |   `True`    |  \|  |           505.54            |            659.92            |  \|  |           2.00            |            1.53            |
+Below, time metrics for batch size = 8, input length = 32.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           968.83            |            443.80            |  \|  |           1.07            |            2.27            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           1255.70           |            489.55            |  \|  |           0.80            |            2.07            |
+|       `['Add']`       |   `False`   |  \|  |           1301.35           |            938.14            |  \|  |           0.80            |            1.07            |
+|       `['Add']`       |   `True`    |  \|  |           1279.54           |            931.91            |  \|  |           0.80            |            1.13            |
+|         `[]`          |   `False`   |  \|  |           1292.66           |           1318.07            |  \|  |           0.80            |            0.80            |
+|         `[]`          |   `True`    |  \|  |           1290.35           |           1314.74            |  \|  |           0.80            |            0.80            |
+Below, time metrics for batch size = 8, input length = 64.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           1305.45           |            438.06            |  \|  |           0.80            |            2.33            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           1296.68           |            450.40            |  \|  |           0.80            |            2.27            |
+|       `['Add']`       |   `False`   |  \|  |           968.21            |            949.81            |  \|  |           1.07            |            1.07            |
+|       `['Add']`       |   `True`    |  \|  |           1012.35           |           1317.46            |  \|  |           1.00            |            0.80            |
+|         `[]`          |   `False`   |  \|  |           1213.91           |            961.79            |  \|  |           0.87            |            1.07            |
+|         `[]`          |   `True`    |  \|  |           956.39            |            945.41            |  \|  |           1.07            |            1.07            |
+Below, time metrics for batch size = 8, input length = 128.
+| operators_to_quantize | per_channel |     | latency_mean (original, ms) | latency_mean (optimized, ms) |     | throughput (original, /s) | throughput (optimized, /s) |
+| :-------------------: | :---------: | :-: | :-------------------------: | :--------------------------: | :-: | :-----------------------: | :------------------------: |
+|  `['Add', 'MatMul']`  |   `False`   |  \|  |           1120.12           |            497.17            |  \|  |           0.93            |            2.07            |
+|  `['Add', 'MatMul']`  |   `True`    |  \|  |           1289.50           |            443.46            |  \|  |           0.80            |            2.27            |
+|       `['Add']`       |   `False`   |  \|  |           1294.65           |            930.97            |  \|  |           0.80            |            1.13            |
+|       `['Add']`       |   `True`    |  \|  |           1181.21           |            933.82            |  \|  |           0.87            |            1.13            |
+|         `[]`          |   `False`   |  \|  |           1245.61           |           1318.07            |  \|  |           0.87            |            0.80            |
+|         `[]`          |   `True`    |  \|  |           1285.81           |           1318.82            |  \|  |           0.80            |            0.80            |

runs.json ADDED Viewed

	@@ -0,0 +1,1728 @@

+[
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": false,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3100.973\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1292.6622040833333,
+                        "latency_std": 14.378985581543033,
+                        "latency_50": 1292.895404,
+                        "latency_90": 1309.2992319,
+                        "latency_95": 1312.26437405,
+                        "latency_99": 1315.14182841,
+                        "latency_999": 1315.789255641
+                    },
+                    "optimized": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1318.07484525,
+                        "latency_std": 2.1649871576445947,
+                        "latency_50": 1317.9739275,
+                        "latency_90": 1321.6588582,
+                        "latency_95": 1322.10577285,
+                        "latency_99": 1322.27261777,
+                        "latency_999": 1322.3101578770002
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 13,
+                        "throughput": 0.87,
+                        "latency_mean": 1213.9143344615386,
+                        "latency_std": 110.1812092312708,
+                        "latency_50": 1259.642669,
+                        "latency_90": 1280.0082366,
+                        "latency_95": 1282.0939434000002,
+                        "latency_99": 1283.78822388,
+                        "latency_999": 1284.169436988
+                    },
+                    "optimized": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 961.791823,
+                        "latency_std": 104.26168200680283,
+                        "latency_50": 928.5080535,
+                        "latency_90": 1027.710494,
+                        "latency_95": 1167.36817775,
+                        "latency_99": 1291.45591475,
+                        "latency_999": 1319.3756555749999
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 90,
+                        "throughput": 6.0,
+                        "latency_mean": 166.9657096111111,
+                        "latency_std": 19.234928655802502,
+                        "latency_50": 155.146231,
+                        "latency_90": 197.2404852,
+                        "latency_95": 197.6184337,
+                        "latency_99": 198.50318102,
+                        "latency_999": 199.948491002
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.14788376344086,
+                        "latency_std": 0.6774441784666013,
+                        "latency_50": 162.056419,
+                        "latency_90": 162.494086,
+                        "latency_95": 162.57127219999998,
+                        "latency_99": 163.26029604,
+                        "latency_999": 167.65193600400002
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 13,
+                        "throughput": 0.87,
+                        "latency_mean": 1245.6068604615386,
+                        "latency_std": 77.66408642086917,
+                        "latency_50": 1272.574299,
+                        "latency_90": 1274.933101,
+                        "latency_95": 1275.4852415999999,
+                        "latency_99": 1275.65235792,
+                        "latency_999": 1275.689959092
+                    },
+                    "optimized": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1318.074927,
+                        "latency_std": 2.5544177471077765,
+                        "latency_50": 1317.6566105,
+                        "latency_90": 1321.9495818,
+                        "latency_95": 1322.64612275,
+                        "latency_99": 1323.13381655,
+                        "latency_999": 1323.243547655
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 27,
+                        "throughput": 1.8,
+                        "latency_mean": 562.8057038518518,
+                        "latency_std": 71.38555755484205,
+                        "latency_50": 509.265269,
+                        "latency_90": 654.6393252,
+                        "latency_95": 654.9679175,
+                        "latency_99": 656.29719274,
+                        "latency_999": 656.685855274
+                    },
+                    "optimized": {
+                        "nb_forwards": 30,
+                        "throughput": 2.0,
+                        "latency_mean": 501.7736329333333,
+                        "latency_std": 55.80762294417158,
+                        "latency_50": 480.2483315,
+                        "latency_90": 542.9033260000002,
+                        "latency_95": 658.5849407000001,
+                        "latency_99": 663.27316083,
+                        "latency_999": 664.471430883
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 79,
+                        "throughput": 5.27,
+                        "latency_mean": 191.62850325316455,
+                        "latency_std": 17.746003780554855,
+                        "latency_50": 199.661154,
+                        "latency_90": 203.1430258,
+                        "latency_95": 204.9078804,
+                        "latency_99": 208.3248753,
+                        "latency_999": 208.39238312999998
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.54266559139785,
+                        "latency_std": 0.5169025249532633,
+                        "latency_50": 162.432963,
+                        "latency_90": 163.1380686,
+                        "latency_95": 163.295702,
+                        "latency_99": 164.58895547999998,
+                        "latency_999": 164.993086548
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 661.6382227391304,
+                        "latency_std": 0.8913331138378804,
+                        "latency_50": 661.329055,
+                        "latency_90": 662.309612,
+                        "latency_95": 663.6607806000001,
+                        "latency_99": 664.23226672,
+                        "latency_999": 664.339438972
+                    },
+                    "optimized": {
+                        "nb_forwards": 32,
+                        "throughput": 2.13,
+                        "latency_mean": 474.27704703125,
+                        "latency_std": 8.662769183818762,
+                        "latency_50": 474.1899635,
+                        "latency_90": 485.4294334,
+                        "latency_95": 485.85774355,
+                        "latency_99": 487.66797085,
+                        "latency_999": 488.291721385
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 655.924965826087,
+                        "latency_std": 3.296725754580373,
+                        "latency_50": 654.734062,
+                        "latency_90": 660.6470922000001,
+                        "latency_95": 662.5686226,
+                        "latency_99": 664.9523018,
+                        "latency_999": 665.51169338
+                    },
+                    "optimized": {
+                        "nb_forwards": 32,
+                        "throughput": 2.13,
+                        "latency_mean": 473.009812875,
+                        "latency_std": 8.845356653932509,
+                        "latency_50": 471.9876055,
+                        "latency_90": 482.61548319999997,
+                        "latency_95": 485.2878379,
+                        "latency_99": 488.35063118,
+                        "latency_999": 489.241902818
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 79,
+                        "throughput": 5.27,
+                        "latency_mean": 190.98778439240508,
+                        "latency_std": 17.30226702851961,
+                        "latency_50": 199.368468,
+                        "latency_90": 200.088627,
+                        "latency_95": 200.4188227,
+                        "latency_99": 200.62923102000002,
+                        "latency_999": 200.790088002
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.05550177419354,
+                        "latency_std": 0.2734454225531516,
+                        "latency_50": 162.042985,
+                        "latency_90": 162.41303159999998,
+                        "latency_95": 162.54073480000002,
+                        "latency_99": 162.64262688,
+                        "latency_999": 162.750607188
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    },
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [
+            "Add",
+            "MatMul"
+        ],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": false,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3102.494\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1305.4477120833333,
+                        "latency_std": 35.538171285940294,
+                        "latency_50": 1292.307282,
+                        "latency_90": 1365.7192274000001,
+                        "latency_95": 1365.90585495,
+                        "latency_99": 1366.05358539,
+                        "latency_999": 1366.0868247390001
+                    },
+                    "optimized": {
+                        "nb_forwards": 35,
+                        "throughput": 2.33,
+                        "latency_mean": 438.0590386857143,
+                        "latency_std": 9.397001314248586,
+                        "latency_50": 437.299136,
+                        "latency_90": 450.9606182,
+                        "latency_95": 453.12420089999995,
+                        "latency_99": 464.37316491999997,
+                        "latency_999": 468.224622292
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 75,
+                        "throughput": 5.0,
+                        "latency_mean": 201.245758,
+                        "latency_std": 1.6999127687727664,
+                        "latency_50": 201.408908,
+                        "latency_90": 203.4231772,
+                        "latency_95": 203.8507052,
+                        "latency_99": 205.37529206,
+                        "latency_999": 205.54267850600002
+                    },
+                    "optimized": {
+                        "nb_forwards": 214,
+                        "throughput": 14.27,
+                        "latency_mean": 70.30087592990655,
+                        "latency_std": 0.35279272958603625,
+                        "latency_50": 70.2417255,
+                        "latency_90": 70.7161781,
+                        "latency_95": 70.98808025,
+                        "latency_99": 71.56634268,
+                        "latency_999": 71.98565450699999
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 968.8260539375,
+                        "latency_std": 15.812966073710593,
+                        "latency_50": 971.2266795,
+                        "latency_90": 989.217858,
+                        "latency_95": 991.6490175,
+                        "latency_99": 993.7897995,
+                        "latency_999": 994.27147545
+                    },
+                    "optimized": {
+                        "nb_forwards": 34,
+                        "throughput": 2.27,
+                        "latency_mean": 443.7999996176471,
+                        "latency_std": 14.010678308843712,
+                        "latency_50": 439.840854,
+                        "latency_90": 470.9668241,
+                        "latency_95": 472.79326175,
+                        "latency_99": 481.8465039,
+                        "latency_999": 485.13520569
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 655.8404614782609,
+                        "latency_std": 1.9853736674833404,
+                        "latency_50": 655.276323,
+                        "latency_90": 657.7868594,
+                        "latency_95": 659.0648136,
+                        "latency_99": 662.15786138,
+                        "latency_999": 662.908107338
+                    },
+                    "optimized": {
+                        "nb_forwards": 62,
+                        "throughput": 4.13,
+                        "latency_mean": 243.33499380645162,
+                        "latency_std": 19.880611383661382,
+                        "latency_50": 258.4691415,
+                        "latency_90": 260.5706684,
+                        "latency_95": 260.7413557,
+                        "latency_99": 262.49824262,
+                        "latency_999": 263.22405246200003
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 14,
+                        "throughput": 0.93,
+                        "latency_mean": 1120.115416357143,
+                        "latency_std": 162.01666423442674,
+                        "latency_50": 1093.1451635,
+                        "latency_90": 1307.5234237,
+                        "latency_95": 1317.8936085999999,
+                        "latency_99": 1320.5370161199999,
+                        "latency_999": 1321.131782812
+                    },
+                    "optimized": {
+                        "nb_forwards": 31,
+                        "throughput": 2.07,
+                        "latency_mean": 497.1726317419355,
+                        "latency_std": 45.71850065729019,
+                        "latency_50": 528.362607,
+                        "latency_90": 532.55914,
+                        "latency_95": 533.1437515,
+                        "latency_99": 536.8183057,
+                        "latency_999": 538.16474467
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 654.7982457826088,
+                        "latency_std": 0.9735164168304667,
+                        "latency_50": 654.840949,
+                        "latency_90": 656.1865757999999,
+                        "latency_95": 656.3044589,
+                        "latency_99": 656.7216594600001,
+                        "latency_999": 656.827115646
+                    },
+                    "optimized": {
+                        "nb_forwards": 69,
+                        "throughput": 4.6,
+                        "latency_mean": 219.3826653768116,
+                        "latency_std": 4.794156646156331,
+                        "latency_50": 217.991558,
+                        "latency_90": 225.22350740000002,
+                        "latency_95": 230.0291534,
+                        "latency_99": 233.93463011999998,
+                        "latency_999": 236.29746481200002
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 654.579026173913,
+                        "latency_std": 0.9392631004917424,
+                        "latency_50": 654.222943,
+                        "latency_90": 655.8028544,
+                        "latency_95": 656.1702655,
+                        "latency_99": 657.1910353,
+                        "latency_999": 657.44114593
+                    },
+                    "optimized": {
+                        "nb_forwards": 59,
+                        "throughput": 3.93,
+                        "latency_mean": 258.5395601016949,
+                        "latency_std": 0.9767678625811486,
+                        "latency_50": 258.393859,
+                        "latency_90": 259.7185146,
+                        "latency_95": 260.0651381,
+                        "latency_99": 261.55840048,
+                        "latency_999": 261.836545048
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 76,
+                        "throughput": 5.07,
+                        "latency_mean": 199.48443313157895,
+                        "latency_std": 0.597793597350561,
+                        "latency_50": 199.475957,
+                        "latency_90": 200.222022,
+                        "latency_95": 200.34930725,
+                        "latency_99": 200.67836,
+                        "latency_999": 201.2770931
+                    },
+                    "optimized": {
+                        "nb_forwards": 212,
+                        "throughput": 14.13,
+                        "latency_mean": 70.97961195754718,
+                        "latency_std": 0.7368611290841657,
+                        "latency_50": 70.863433,
+                        "latency_90": 71.6552091,
+                        "latency_95": 71.9515168,
+                        "latency_99": 73.01431738,
+                        "latency_999": 73.983875939
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 91,
+                        "throughput": 6.07,
+                        "latency_mean": 165.84583646153845,
+                        "latency_std": 19.342885270197435,
+                        "latency_50": 156.269802,
+                        "latency_90": 201.686354,
+                        "latency_95": 202.9267375,
+                        "latency_99": 204.1038235,
+                        "latency_999": 204.40679995
+                    },
+                    "optimized": {
+                        "nb_forwards": 213,
+                        "throughput": 14.2,
+                        "latency_mean": 70.59720956338029,
+                        "latency_std": 0.2543356304868392,
+                        "latency_50": 70.562235,
+                        "latency_90": 70.9770522,
+                        "latency_95": 71.05967240000001,
+                        "latency_99": 71.12613707999999,
+                        "latency_999": 71.243581772
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    },
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": true,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3094.719\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 91,
+                        "throughput": 6.07,
+                        "latency_mean": 166.31874561538461,
+                        "latency_std": 19.72451378066001,
+                        "latency_50": 155.056089,
+                        "latency_90": 199.330499,
+                        "latency_95": 200.1285845,
+                        "latency_99": 201.3523765,
+                        "latency_999": 202.36380325
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.8075684516129,
+                        "latency_std": 0.4899426115628021,
+                        "latency_50": 162.752657,
+                        "latency_90": 163.428205,
+                        "latency_95": 163.61467280000002,
+                        "latency_99": 164.02712072,
+                        "latency_999": 164.65947177200002
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 661.2918193478262,
+                        "latency_std": 1.2312122042374622,
+                        "latency_50": 660.888877,
+                        "latency_90": 662.8252232000001,
+                        "latency_95": 664.1392218999999,
+                        "latency_99": 664.48232834,
+                        "latency_999": 664.5344920340001
+                    },
+                    "optimized": {
+                        "nb_forwards": 32,
+                        "throughput": 2.13,
+                        "latency_mean": 471.08557159375,
+                        "latency_std": 6.881870741980042,
+                        "latency_50": 471.846767,
+                        "latency_90": 477.006957,
+                        "latency_95": 479.45303175,
+                        "latency_99": 483.97764233,
+                        "latency_999": 484.659785333
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 956.3890503125,
+                        "latency_std": 17.516545911122616,
+                        "latency_50": 955.3814065,
+                        "latency_90": 971.4774225,
+                        "latency_95": 986.509896,
+                        "latency_99": 1003.5322224,
+                        "latency_999": 1007.36224584
+                    },
+                    "optimized": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 945.406017875,
+                        "latency_std": 34.15676228112691,
+                        "latency_50": 935.284914,
+                        "latency_90": 995.0056065,
+                        "latency_95": 1010.02209725,
+                        "latency_99": 1033.38679385,
+                        "latency_999": 1038.6438505849999
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 30,
+                        "throughput": 2.0,
+                        "latency_mean": 505.8109903,
+                        "latency_std": 8.216224152297352,
+                        "latency_50": 503.793211,
+                        "latency_90": 511.9732179,
+                        "latency_95": 520.8387349,
+                        "latency_99": 533.81575209,
+                        "latency_999": 537.108679209
+                    },
+                    "optimized": {
+                        "nb_forwards": 29,
+                        "throughput": 1.93,
+                        "latency_mean": 521.2037808275862,
+                        "latency_std": 80.14289996554862,
+                        "latency_50": 476.865559,
+                        "latency_90": 659.4999777999999,
+                        "latency_95": 660.1967527999999,
+                        "latency_99": 662.42166376,
+                        "latency_999": 663.0680583760001
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1285.8075280833332,
+                        "latency_std": 14.878247016495994,
+                        "latency_50": 1276.9767825,
+                        "latency_90": 1305.3958274000001,
+                        "latency_95": 1313.57056855,
+                        "latency_99": 1320.24036971,
+                        "latency_999": 1321.741074971
+                    },
+                    "optimized": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1318.8150754166668,
+                        "latency_std": 3.555651675868726,
+                        "latency_50": 1318.741799,
+                        "latency_90": 1322.4943963,
+                        "latency_95": 1324.30691675,
+                        "latency_99": 1325.98598095,
+                        "latency_999": 1326.363770395
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 30,
+                        "throughput": 2.0,
+                        "latency_mean": 505.5449488333333,
+                        "latency_std": 2.9916510998013135,
+                        "latency_50": 505.719641,
+                        "latency_90": 509.8432626,
+                        "latency_95": 510.09288375,
+                        "latency_99": 510.2705112,
+                        "latency_999": 510.32699682
+                    },
+                    "optimized": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 659.9233325217391,
+                        "latency_std": 1.3320484773330812,
+                        "latency_50": 659.844342,
+                        "latency_90": 661.1736396,
+                        "latency_95": 662.8099841000001,
+                        "latency_99": 663.25936154,
+                        "latency_999": 663.329135354
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 97,
+                        "throughput": 6.47,
+                        "latency_mean": 155.1479355773196,
+                        "latency_std": 2.333512915391771,
+                        "latency_50": 155.187476,
+                        "latency_90": 157.83224819999998,
+                        "latency_95": 158.7266684,
+                        "latency_99": 159.5641865199999,
+                        "latency_999": 166.83555695199996
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.52027249462367,
+                        "latency_std": 0.3889875917639462,
+                        "latency_50": 162.494248,
+                        "latency_90": 163.1346838,
+                        "latency_95": 163.3073516,
+                        "latency_99": 163.47998024,
+                        "latency_999": 163.503808424
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 75,
+                        "throughput": 5.0,
+                        "latency_mean": 200.38047430666668,
+                        "latency_std": 1.8423713060693128,
+                        "latency_50": 200.733902,
+                        "latency_90": 202.2674594,
+                        "latency_95": 203.2803858,
+                        "latency_99": 204.16154648,
+                        "latency_999": 204.63530424799998
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.5525952580645,
+                        "latency_std": 0.5907606366490742,
+                        "latency_50": 162.381627,
+                        "latency_90": 163.299415,
+                        "latency_95": 163.5639292,
+                        "latency_99": 164.28743187999999,
+                        "latency_999": 165.355540288
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1290.3547091666667,
+                        "latency_std": 25.877029400989002,
+                        "latency_50": 1284.203467,
+                        "latency_90": 1326.1323579,
+                        "latency_95": 1341.7394276,
+                        "latency_99": 1353.55684552,
+                        "latency_999": 1356.215764552
+                    },
+                    "optimized": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1314.744368,
+                        "latency_std": 2.6878210594945364,
+                        "latency_50": 1314.675457,
+                        "latency_90": 1317.7293187999999,
+                        "latency_95": 1318.6035473,
+                        "latency_99": 1319.4279286600001,
+                        "latency_999": 1319.613414466
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    },
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [
+            "Add"
+        ],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": true,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3098.350\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 75,
+                        "throughput": 5.0,
+                        "latency_mean": 200.81783336,
+                        "latency_std": 0.6847088030325814,
+                        "latency_50": 200.719484,
+                        "latency_90": 201.5889308,
+                        "latency_95": 202.1003811,
+                        "latency_99": 202.8800941,
+                        "latency_999": 203.30975400999998
+                    },
+                    "optimized": {
+                        "nb_forwards": 92,
+                        "throughput": 6.13,
+                        "latency_mean": 163.16872673913045,
+                        "latency_std": 1.5526071244167148,
+                        "latency_50": 162.727792,
+                        "latency_90": 164.082629,
+                        "latency_95": 164.58716825,
+                        "latency_99": 171.05059297999998,
+                        "latency_999": 171.827268698
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1279.5420475833332,
+                        "latency_std": 24.090200248013595,
+                        "latency_50": 1270.306791,
+                        "latency_90": 1306.8130119,
+                        "latency_95": 1325.1522126,
+                        "latency_99": 1341.54179372,
+                        "latency_999": 1345.2294494720002
+                    },
+                    "optimized": {
+                        "nb_forwards": 17,
+                        "throughput": 1.13,
+                        "latency_mean": 931.9069634117648,
+                        "latency_std": 11.481934784441602,
+                        "latency_50": 929.600892,
+                        "latency_90": 952.1709192000001,
+                        "latency_95": 952.655295,
+                        "latency_99": 954.1036662,
+                        "latency_999": 954.4295497200001
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 30,
+                        "throughput": 2.0,
+                        "latency_mean": 512.4660677333334,
+                        "latency_std": 27.823912989062475,
+                        "latency_50": 506.9528315,
+                        "latency_90": 512.7785411,
+                        "latency_95": 515.4465188,
+                        "latency_99": 619.0371795800002,
+                        "latency_999": 656.605258058
+                    },
+                    "optimized": {
+                        "nb_forwards": 32,
+                        "throughput": 2.13,
+                        "latency_mean": 470.661379375,
+                        "latency_std": 6.879698154147933,
+                        "latency_50": 469.55909,
+                        "latency_90": 481.2147577,
+                        "latency_95": 482.3090949,
+                        "latency_99": 486.69427243,
+                        "latency_999": 488.18883274300003
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 80,
+                        "throughput": 5.33,
+                        "latency_mean": 189.92974869999998,
+                        "latency_std": 20.0085609938314,
+                        "latency_50": 200.9213655,
+                        "latency_90": 204.7879244,
+                        "latency_95": 206.2371924,
+                        "latency_99": 210.71894763999998,
+                        "latency_999": 211.282830364
+                    },
+                    "optimized": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 162.44556237634407,
+                        "latency_std": 0.3515497302340489,
+                        "latency_50": 162.410174,
+                        "latency_90": 162.9352528,
+                        "latency_95": 163.0813806,
+                        "latency_99": 163.21715432,
+                        "latency_999": 163.446762032
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 657.0060832608697,
+                        "latency_std": 1.6045997297419479,
+                        "latency_50": 656.987305,
+                        "latency_90": 658.750338,
+                        "latency_95": 659.6307931,
+                        "latency_99": 660.7225840599999,
+                        "latency_999": 660.978761806
+                    },
+                    "optimized": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 660.2285839130435,
+                        "latency_std": 1.684238379976331,
+                        "latency_50": 659.931222,
+                        "latency_90": 662.106911,
+                        "latency_95": 662.5187496,
+                        "latency_99": 664.16473886,
+                        "latency_999": 664.572299486
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 97,
+                        "throughput": 6.47,
+                        "latency_mean": 154.67651497938144,
+                        "latency_std": 2.265059008007973,
+                        "latency_50": 154.530164,
+                        "latency_90": 158.0063524,
+                        "latency_95": 158.761275,
+                        "latency_99": 159.63890179999999,
+                        "latency_999": 161.83127588
+                    },
+                    "optimized": {
+                        "nb_forwards": 111,
+                        "throughput": 7.4,
+                        "latency_mean": 136.42054264864865,
+                        "latency_std": 13.669195772141308,
+                        "latency_50": 129.657713,
+                        "latency_90": 162.972988,
+                        "latency_95": 163.5093255,
+                        "latency_99": 164.2452398,
+                        "latency_999": 164.73538388999998
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 15,
+                        "throughput": 1.0,
+                        "latency_mean": 1012.3499144666667,
+                        "latency_std": 101.51911082567241,
+                        "latency_50": 971.620166,
+                        "latency_90": 1118.203026,
+                        "latency_95": 1211.1398262999996,
+                        "latency_99": 1307.9172236599998,
+                        "latency_999": 1329.6921380660003
+                    },
+                    "optimized": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1317.46020725,
+                        "latency_std": 3.756500822804119,
+                        "latency_50": 1317.000587,
+                        "latency_90": 1322.3999772,
+                        "latency_95": 1323.5495317,
+                        "latency_99": 1324.5479823399999,
+                        "latency_999": 1324.772633734
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 655.0756931304347,
+                        "latency_std": 1.4599780685925838,
+                        "latency_50": 654.878779,
+                        "latency_90": 656.67802,
+                        "latency_95": 658.3078226,
+                        "latency_99": 658.6252101,
+                        "latency_999": 658.6617698099999
+                    },
+                    "optimized": {
+                        "nb_forwards": 31,
+                        "throughput": 2.07,
+                        "latency_mean": 499.40584877419354,
+                        "latency_std": 69.82345023327741,
+                        "latency_50": 469.970446,
+                        "latency_90": 659.315274,
+                        "latency_95": 661.5421805,
+                        "latency_99": 662.4330344,
+                        "latency_999": 662.63913944
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 13,
+                        "throughput": 0.87,
+                        "latency_mean": 1181.2107936923078,
+                        "latency_std": 146.80801899246256,
+                        "latency_50": 1260.661433,
+                        "latency_90": 1303.1271828,
+                        "latency_95": 1325.1133756,
+                        "latency_99": 1343.53688152,
+                        "latency_999": 1347.682170352
+                    },
+                    "optimized": {
+                        "nb_forwards": 17,
+                        "throughput": 1.13,
+                        "latency_mean": 933.8200267058824,
+                        "latency_std": 25.86898009147869,
+                        "latency_50": 927.867181,
+                        "latency_90": 954.8453836,
+                        "latency_95": 967.3439077999999,
+                        "latency_99": 1006.87954876,
+                        "latency_999": 1015.775067976
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    },
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [
+            "Add",
+            "MatMul"
+        ],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": true,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3101.736\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 93,
+                        "throughput": 6.2,
+                        "latency_mean": 161.41171223655914,
+                        "latency_std": 14.870354336577428,
+                        "latency_50": 155.939994,
+                        "latency_90": 197.53404419999998,
+                        "latency_95": 197.912589,
+                        "latency_99": 198.83022556,
+                        "latency_999": 199.676923456
+                    },
+                    "optimized": {
+                        "nb_forwards": 207,
+                        "throughput": 13.8,
+                        "latency_mean": 72.70885894202898,
+                        "latency_std": 0.27964445519343206,
+                        "latency_50": 72.716042,
+                        "latency_90": 73.0758788,
+                        "latency_95": 73.17664590000001,
+                        "latency_99": 73.3802671,
+                        "latency_999": 73.51221445200001
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 76,
+                        "throughput": 5.07,
+                        "latency_mean": 199.65074372368423,
+                        "latency_std": 1.5160773445469795,
+                        "latency_50": 199.944024,
+                        "latency_90": 201.29589,
+                        "latency_95": 201.80196475,
+                        "latency_99": 202.12260975,
+                        "latency_999": 202.166467875
+                    },
+                    "optimized": {
+                        "nb_forwards": 209,
+                        "throughput": 13.93,
+                        "latency_mean": 71.78032969856459,
+                        "latency_std": 0.3993047349686967,
+                        "latency_50": 71.737218,
+                        "latency_90": 72.21902340000001,
+                        "latency_95": 72.5808502,
+                        "latency_99": 73.12714656,
+                        "latency_999": 73.66801959200001
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 25,
+                        "throughput": 1.67,
+                        "latency_mean": 617.44332692,
+                        "latency_std": 67.01575583161043,
+                        "latency_50": 660.252227,
+                        "latency_90": 663.977544,
+                        "latency_95": 666.9361547999999,
+                        "latency_99": 668.6067565599999,
+                        "latency_999": 668.8766228559999
+                    },
+                    "optimized": {
+                        "nb_forwards": 65,
+                        "throughput": 4.33,
+                        "latency_mean": 234.05122506153845,
+                        "latency_std": 19.589519412086375,
+                        "latency_50": 223.845147,
+                        "latency_90": 265.7009672,
+                        "latency_95": 266.8488142,
+                        "latency_99": 268.70655439999996,
+                        "latency_999": 269.60366863999997
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 661.2726233478261,
+                        "latency_std": 1.608488914998379,
+                        "latency_50": 661.364855,
+                        "latency_90": 663.035304,
+                        "latency_95": 663.3135758,
+                        "latency_99": 665.07419858,
+                        "latency_999": 665.515344758
+                    },
+                    "optimized": {
+                        "nb_forwards": 68,
+                        "throughput": 4.53,
+                        "latency_mean": 221.16239911764706,
+                        "latency_std": 5.801385463356238,
+                        "latency_50": 220.699395,
+                        "latency_90": 224.3565545,
+                        "latency_95": 225.22985880000002,
+                        "latency_99": 238.96943696999995,
+                        "latency_999": 260.98804769700024
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1255.69552375,
+                        "latency_std": 102.9288237877332,
+                        "latency_50": 1286.0972765,
+                        "latency_90": 1316.4811832,
+                        "latency_95": 1321.2916282,
+                        "latency_99": 1324.5891536400002,
+                        "latency_999": 1325.331096864
+                    },
+                    "optimized": {
+                        "nb_forwards": 31,
+                        "throughput": 2.07,
+                        "latency_mean": 489.54871912903224,
+                        "latency_std": 47.715982761163175,
+                        "latency_50": 473.097218,
+                        "latency_90": 543.395976,
+                        "latency_95": 545.5166235,
+                        "latency_99": 551.4498183,
+                        "latency_999": 553.21678713
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 663.4977195217391,
+                        "latency_std": 1.2918393132646473,
+                        "latency_50": 663.486377,
+                        "latency_90": 665.3720026,
+                        "latency_95": 665.7250922000001,
+                        "latency_99": 666.08418616,
+                        "latency_999": 666.167132716
+                    },
+                    "optimized": {
+                        "nb_forwards": 68,
+                        "throughput": 4.53,
+                        "latency_mean": 222.37462554411766,
+                        "latency_std": 2.8550137685990054,
+                        "latency_50": 222.140647,
+                        "latency_90": 226.3398684,
+                        "latency_95": 227.585471,
+                        "latency_99": 228.76303193,
+                        "latency_999": 229.374908693
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1296.6752090833334,
+                        "latency_std": 18.972208965807603,
+                        "latency_50": 1289.740581,
+                        "latency_90": 1318.5068230999998,
+                        "latency_95": 1326.9780795,
+                        "latency_99": 1334.9525383,
+                        "latency_999": 1336.7467915299999
+                    },
+                    "optimized": {
+                        "nb_forwards": 34,
+                        "throughput": 2.27,
+                        "latency_mean": 450.402188617647,
+                        "latency_std": 15.121871215842573,
+                        "latency_50": 446.2523015,
+                        "latency_90": 476.4106683,
+                        "latency_95": 481.48467215,
+                        "latency_99": 487.75100149,
+                        "latency_999": 489.587782249
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1289.5001165833332,
+                        "latency_std": 17.973529354626326,
+                        "latency_50": 1282.8982245,
+                        "latency_90": 1316.4851546,
+                        "latency_95": 1319.94446375,
+                        "latency_99": 1321.75708555,
+                        "latency_999": 1322.164925455
+                    },
+                    "optimized": {
+                        "nb_forwards": 34,
+                        "throughput": 2.27,
+                        "latency_mean": 443.4559223235294,
+                        "latency_std": 5.080053264660657,
+                        "latency_50": 443.1060375,
+                        "latency_90": 450.62357180000004,
+                        "latency_95": 451.23264075,
+                        "latency_99": 455.59029821,
+                        "latency_999": 457.427172821
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 74,
+                        "throughput": 4.93,
+                        "latency_mean": 203.5212293918919,
+                        "latency_std": 2.0551978472652848,
+                        "latency_50": 203.3370845,
+                        "latency_90": 205.9722848,
+                        "latency_95": 206.7128496,
+                        "latency_99": 208.65631527,
+                        "latency_999": 210.207952827
+                    },
+                    "optimized": {
+                        "nb_forwards": 207,
+                        "throughput": 13.8,
+                        "latency_mean": 72.47834015458938,
+                        "latency_std": 0.31101237142899824,
+                        "latency_50": 72.463726,
+                        "latency_90": 72.8683692,
+                        "latency_95": 72.9479522,
+                        "latency_99": 73.2749897,
+                        "latency_999": 73.51287976
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    },
+    {
+        "model_name_or_path": "nateraw/vit-base-beans",
+        "task": "image-classification",
+        "dataset": {
+            "path": "beans",
+            "eval_split": "validation",
+            "data_keys": {
+                "primary": "image",
+                "secondary": null
+            },
+            "ref_keys": [
+                "labels"
+            ],
+            "name": null,
+            "calibration_split": null
+        },
+        "quantization_approach": "dynamic",
+        "operators_to_quantize": [
+            "Add"
+        ],
+        "node_exclusion": [],
+        "aware_training": false,
+        "per_channel": false,
+        "calibration": null,
+        "framework": "onnxruntime",
+        "framework_args": {
+            "opset": 11,
+            "optimization_level": 1
+        },
+        "hardware": "Architecture:                    x86_64\nCPU op-mode(s):                  32-bit, 64-bit\nByte Order:                      Little Endian\nAddress sizes:                   46 bits physical, 48 bits virtual\nCPU(s):                          8\nOn-line CPU(s) list:             0-7\nThread(s) per core:              2\nCore(s) per socket:              4\nSocket(s):                       1\nNUMA node(s):                    1\nVendor ID:                       GenuineIntel\nCPU family:                      6\nModel:                           85\nModel name:                      Intel(R) Xeon(R) Platinum 8259CL CPU @ 2.50GHz\nStepping:                        7\nCPU MHz:                         3101.587\nBogoMIPS:                        4999.99\nHypervisor vendor:               KVM\nVirtualization type:             full\nL1d cache:                       128 KiB\nL1i cache:                       128 KiB\nL2 cache:                        4 MiB\nL3 cache:                        35.8 MiB\nNUMA node0 CPU(s):               0-7\nVulnerability Itlb multihit:     KVM: Vulnerable\nVulnerability L1tf:              Mitigation; PTE Inversion\nVulnerability Mds:               Vulnerable: Clear CPU buffers attempted, no microcode; SMT Host state unknown\nVulnerability Meltdown:          Mitigation; PTI\nVulnerability Spec store bypass: Vulnerable\nVulnerability Spectre v1:        Mitigation; usercopy/swapgs barriers and __user pointer sanitization\nVulnerability Spectre v2:        Mitigation; Retpolines, STIBP disabled, RSB filling\nVulnerability Srbds:             Not affected\nVulnerability Tsx async abort:   Not affected\nFlags:                           fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ss ht syscall nx pdpe1gb rdtscp lm constant_tsc rep_good nopl xtopology nonstop_tsc cpuid aperfmperf tsc_known_freq pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand hypervisor lahf_lm abm 3dnowprefetch invpcid_single pti fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid mpx avx512f avx512dq rdseed adx smap clflushopt clwb avx512cd avx512bw avx512vl xsaveopt xsavec xgetbv1 xsaves ida arat pku ospke avx512_vnni\n",
+        "versions": {
+            "transformers": "4.20.1",
+            "optimum": "1.2.3.dev0",
+            "optimum_hash": "5ac9c0d9fd7e7cca55b2f9935b961ed5b6c50112"
+        },
+        "evaluation": {
+            "time": [
+                {
+                    "batch_size": 4,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 661.5120061304348,
+                        "latency_std": 2.106123083282921,
+                        "latency_50": 661.334389,
+                        "latency_90": 663.6329132000001,
+                        "latency_95": 664.633404,
+                        "latency_99": 666.27847948,
+                        "latency_999": 666.671275048
+                    },
+                    "optimized": {
+                        "nb_forwards": 32,
+                        "throughput": 2.13,
+                        "latency_mean": 478.80772409375,
+                        "latency_std": 9.072497172419535,
+                        "latency_50": 478.339513,
+                        "latency_90": 490.9494227,
+                        "latency_95": 491.7442059,
+                        "latency_99": 499.86555171000003,
+                        "latency_999": 502.846627371
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 23,
+                        "throughput": 1.53,
+                        "latency_mean": 662.8350984782609,
+                        "latency_std": 2.1823429520784754,
+                        "latency_50": 662.268507,
+                        "latency_90": 666.2232022000001,
+                        "latency_95": 666.476752,
+                        "latency_99": 666.62039362,
+                        "latency_999": 666.6544751619999
+                    },
+                    "optimized": {
+                        "nb_forwards": 29,
+                        "throughput": 1.93,
+                        "latency_mean": 529.2792873448276,
+                        "latency_std": 87.518562392806,
+                        "latency_50": 475.50113,
+                        "latency_90": 660.0331162,
+                        "latency_95": 661.2807788,
+                        "latency_99": 661.9307412,
+                        "latency_999": 662.12196762
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 75,
+                        "throughput": 5.0,
+                        "latency_mean": 200.44544238666666,
+                        "latency_std": 0.9950016888231685,
+                        "latency_50": 200.190004,
+                        "latency_90": 201.82102559999998,
+                        "latency_95": 202.5074946,
+                        "latency_99": 203.1678269,
+                        "latency_999": 203.50243529
+                    },
+                    "optimized": {
+                        "nb_forwards": 116,
+                        "throughput": 7.73,
+                        "latency_mean": 129.39618714655174,
+                        "latency_std": 2.067964919431859,
+                        "latency_50": 129.4016065,
+                        "latency_90": 131.648931,
+                        "latency_95": 132.2263295,
+                        "latency_99": 136.81370259999997,
+                        "latency_999": 138.54515077000002
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1294.6512800833332,
+                        "latency_std": 16.84406616166231,
+                        "latency_50": 1290.4992525,
+                        "latency_90": 1317.8022775999998,
+                        "latency_95": 1319.4061700999998,
+                        "latency_99": 1320.8464036199998,
+                        "latency_999": 1321.170456162
+                    },
+                    "optimized": {
+                        "nb_forwards": 17,
+                        "throughput": 1.13,
+                        "latency_mean": 930.9679744117648,
+                        "latency_std": 16.6232043529702,
+                        "latency_50": 932.665064,
+                        "latency_90": 951.8628876,
+                        "latency_95": 955.4653676,
+                        "latency_99": 962.11386552,
+                        "latency_999": 963.6097775520001
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 64,
+                    "baseline": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 968.208967375,
+                        "latency_std": 21.2604505171286,
+                        "latency_50": 965.257975,
+                        "latency_90": 992.8459695,
+                        "latency_95": 999.56778275,
+                        "latency_99": 1010.44061495,
+                        "latency_999": 1012.8870021949999
+                    },
+                    "optimized": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 949.805699,
+                        "latency_std": 35.16724690173305,
+                        "latency_50": 939.915342,
+                        "latency_90": 1000.7328375,
+                        "latency_95": 1014.5590895,
+                        "latency_99": 1031.9948771,
+                        "latency_999": 1035.9179293099999
+                    }
+                },
+                {
+                    "batch_size": 8,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 12,
+                        "throughput": 0.8,
+                        "latency_mean": 1301.3549284166668,
+                        "latency_std": 32.02669612806352,
+                        "latency_50": 1287.5151945,
+                        "latency_90": 1354.4028801,
+                        "latency_95": 1358.3970296,
+                        "latency_99": 1361.5798787200001,
+                        "latency_999": 1362.2960197720001
+                    },
+                    "optimized": {
+                        "nb_forwards": 16,
+                        "throughput": 1.07,
+                        "latency_mean": 938.1431063125,
+                        "latency_std": 10.912116938304163,
+                        "latency_50": 935.909206,
+                        "latency_90": 950.346758,
+                        "latency_95": 953.7962335,
+                        "latency_99": 959.9162083,
+                        "latency_999": 961.29320263
+                    }
+                },
+                {
+                    "batch_size": 4,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 24,
+                        "throughput": 1.6,
+                        "latency_mean": 625.5643148333334,
+                        "latency_std": 64.81901502308723,
+                        "latency_50": 664.440001,
+                        "latency_90": 666.4835666,
+                        "latency_95": 666.6322609,
+                        "latency_99": 667.7429191699999,
+                        "latency_999": 668.034834917
+                    },
+                    "optimized": {
+                        "nb_forwards": 29,
+                        "throughput": 1.93,
+                        "latency_mean": 529.020664137931,
+                        "latency_std": 84.62326588659242,
+                        "latency_50": 480.707711,
+                        "latency_90": 665.4676168,
+                        "latency_95": 665.8970696,
+                        "latency_99": 666.84264784,
+                        "latency_999": 667.137783184
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 32,
+                    "baseline": {
+                        "nb_forwards": 91,
+                        "throughput": 6.07,
+                        "latency_mean": 166.03000134065934,
+                        "latency_std": 19.132517193102455,
+                        "latency_50": 157.036032,
+                        "latency_90": 202.654865,
+                        "latency_95": 203.5852275,
+                        "latency_99": 204.973207,
+                        "latency_999": 205.58968180000002
+                    },
+                    "optimized": {
+                        "nb_forwards": 100,
+                        "throughput": 6.67,
+                        "latency_mean": 150.93316497,
+                        "latency_std": 15.730253522641698,
+                        "latency_50": 162.552646,
+                        "latency_90": 163.3473,
+                        "latency_95": 163.7605152,
+                        "latency_99": 165.11680815,
+                        "latency_999": 165.673785615
+                    }
+                },
+                {
+                    "batch_size": 1,
+                    "input_length": 128,
+                    "baseline": {
+                        "nb_forwards": 76,
+                        "throughput": 5.07,
+                        "latency_mean": 199.0767317368421,
+                        "latency_std": 0.6088183471893983,
+                        "latency_50": 198.975675,
+                        "latency_90": 199.7272365,
+                        "latency_95": 200.26232875,
+                        "latency_99": 200.9589925,
+                        "latency_999": 201.30218545
+                    },
+                    "optimized": {
+                        "nb_forwards": 109,
+                        "throughput": 7.27,
+                        "latency_mean": 137.9706248715596,
+                        "latency_std": 15.205285051787508,
+                        "latency_50": 129.352799,
+                        "latency_90": 162.6187408,
+                        "latency_95": 162.7682896,
+                        "latency_99": 162.88476124000002,
+                        "latency_999": 162.94030542800002
+                    }
+                }
+            ],
+            "others": {
+                "baseline": {
+                    "accuracy": 0.98
+                },
+                "optimized": {
+                    "accuracy": 0.98
+                }
+            }
+        },
+        "max_eval_samples": null,
+        "time_benchmark_args": {
+            "duration": 15,
+            "warmup_runs": 5
+        },
+        "model_type": "vit"
+    }
+]

tensorboard/1657613137.713749/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c60208dfb0f65261910cfe99be8285d77deaa8204ce20104f03a2684478a848a
+size 717

tensorboard/1657613137.7153692/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:125e35f10746ffd2aa47f7ea87f02826a7182b78a9f56a3199f2d5d6ca0c7a26
+size 716

tensorboard/1657613137.7166162/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:69d188a9fbf1f849b01f032e672cf5f6eb719e02edf75b321084719c85aa7f3e
+size 707

tensorboard/1657613137.7177505/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:8c88aad85e58fda751a35106a501f7dd86be9968188d39cc616c59296d42f90d
+size 706

tensorboard/1657613137.7188103/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d1a4cc9bd0baddc21971282ad821bca10fca4fe351cc3b7551344dc961147df5
+size 703

tensorboard/1657613137.7203975/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a89c88942d70064790a326a7ac45441d2b42e0533f71dae88b442f9f62a2b147
+size 702

tensorboard/events.out.tfevents.1657613137.ip-10-0-107-255.ec2.internal.1.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d5001adbabe10e29288d08640d258997a3ebe34f2509e74831d27039bd7a08d3
+size 40