Felix Marty
commited on
Commit
•
a98cdd1
1
Parent(s):
2596ac4
fix image display
Browse files
README.md
CHANGED
@@ -19,12 +19,8 @@ from optimum.onnxruntime import ORTModelForSequenceClassification
|
|
19 |
model = ORTModelForSequenceClassification.from_pretrained("fxmarty/distilbert-base-uncased-finetuned-sst-2-english-int8-static-dedicated-qdq-everywhere")
|
20 |
```
|
21 |
|
22 |
-
|
23 |
-
<summary>Weights stored as int8, only DequantizeLinear nodes (model here: https://huggingface.co/fxmarty/distilbert-base-uncased-finetuned-sst-2-english-int8-static)</summary>
|
24 |
![DQ only](no_qdq.png)
|
25 |
-
</details>
|
26 |
|
27 |
-
|
28 |
-
<summary>Weights stored as fp32, only QuantizeLinear + DequantizeLinear nodes (this model)</summary>
|
29 |
![QDQ](qdq.png)
|
30 |
-
</details>
|
|
|
19 |
model = ORTModelForSequenceClassification.from_pretrained("fxmarty/distilbert-base-uncased-finetuned-sst-2-english-int8-static-dedicated-qdq-everywhere")
|
20 |
```
|
21 |
|
22 |
+
### Weights stored as int8, only DequantizeLinear nodes (model here: https://huggingface.co/fxmarty/distilbert-base-uncased-finetuned-sst-2-english-int8-static)
|
|
|
23 |
![DQ only](no_qdq.png)
|
|
|
24 |
|
25 |
+
### Weights stored as fp32, only QuantizeLinear + DequantizeLinear nodes (this model)
|
|
|
26 |
![QDQ](qdq.png)
|
|