qaihm-bot commited on
Commit
76cab13
1 Parent(s): a6324d8

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -38,9 +38,9 @@ More details on model performance across various devices, can be found
38
 
39
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  | ---|---|---|---|---|---|---|---|
41
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.701 ms | 0 - 1 MB | INT8 | NPU | [TextEncoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/TextEncoder_Quantized.bin)
42
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 216.248 ms | 0 - 1 MB | INT8 | NPU | [VAEDecoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/VAEDecoder_Quantized.bin)
43
- | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 100.062 ms | 0 - 2 MB | INT8 | NPU | [UNet_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/UNet_Quantized.bin)
44
 
45
 
46
 
@@ -103,22 +103,22 @@ python -m qai_hub_models.models.stable_diffusion_v2_1_quantized.export
103
  Profile Job summary of TextEncoder_Quantized
104
  --------------------------------------------------
105
  Device: Snapdragon X Elite CRD (11)
106
- Estimated Inference Time: 11.80 ms
107
- Estimated Peak Memory Range: 0.02-0.02 MB
108
  Compute Units: NPU (1040) | Total (1040)
109
 
110
  Profile Job summary of VAEDecoder_Quantized
111
  --------------------------------------------------
112
  Device: Snapdragon X Elite CRD (11)
113
- Estimated Inference Time: 220.25 ms
114
- Estimated Peak Memory Range: 0.04-0.04 MB
115
  Compute Units: NPU (170) | Total (170)
116
 
117
  Profile Job summary of UNet_Quantized
118
  --------------------------------------------------
119
  Device: Snapdragon X Elite CRD (11)
120
- Estimated Inference Time: 102.50 ms
121
- Estimated Peak Memory Range: 0.20-0.20 MB
122
  Compute Units: NPU (6361) | Total (6361)
123
 
124
 
@@ -242,7 +242,7 @@ Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
242
  * [Source Model Implementation](https://github.com/CompVis/stable-diffusion/tree/main)
243
 
244
  ## Community
245
- * Join [our AI Hub Slack community](https://qualcomm-ai-hub.slack.com/join/shared_invite/zt-2d5zsmas3-Sj0Q9TzslueCjS31eXG2UA#/shared-invite/email) to collaborate, post questions and learn more about on-device AI.
246
  * For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).
247
 
248
 
 
38
 
39
  | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  | ---|---|---|---|---|---|---|---|
41
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 11.633 ms | 0 - 1 MB | INT8 | NPU | [TextEncoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/TextEncoder_Quantized.bin)
42
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 217.134 ms | 0 - 2 MB | INT8 | NPU | [VAEDecoder_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/VAEDecoder_Quantized.bin)
43
+ | Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Binary | 101.094 ms | 0 - 2 MB | INT8 | NPU | [UNet_Quantized.bin](https://huggingface.co/qualcomm/Stable-Diffusion-v2.1/blob/main/UNet_Quantized.bin)
44
 
45
 
46
 
 
103
  Profile Job summary of TextEncoder_Quantized
104
  --------------------------------------------------
105
  Device: Snapdragon X Elite CRD (11)
106
+ Estimated Inference Time: 11.77 ms
107
+ Estimated Peak Memory Range: 0.01-0.01 MB
108
  Compute Units: NPU (1040) | Total (1040)
109
 
110
  Profile Job summary of VAEDecoder_Quantized
111
  --------------------------------------------------
112
  Device: Snapdragon X Elite CRD (11)
113
+ Estimated Inference Time: 220.18 ms
114
+ Estimated Peak Memory Range: 0.05-0.05 MB
115
  Compute Units: NPU (170) | Total (170)
116
 
117
  Profile Job summary of UNet_Quantized
118
  --------------------------------------------------
119
  Device: Snapdragon X Elite CRD (11)
120
+ Estimated Inference Time: 102.49 ms
121
+ Estimated Peak Memory Range: 0.19-0.19 MB
122
  Compute Units: NPU (6361) | Total (6361)
123
 
124
 
 
242
  * [Source Model Implementation](https://github.com/CompVis/stable-diffusion/tree/main)
243
 
244
  ## Community
245
+ * Join [our AI Hub Slack community](https://aihub.qualcomm.com/community/slack) to collaborate, post questions and learn more about on-device AI.
246
  * For questions or feedback please [reach out to us](mailto:ai-hub-support@qti.qualcomm.com).
247
 
248