bhushans commited on
Commit
a20eb8a
1 Parent(s): 57b2e1e

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +58 -53
README.md CHANGED
@@ -38,52 +38,56 @@ More details on model performance across various devices, can be found
38
 
39
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
- | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 203.933 ms | 22 - 93 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
42
- | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 305.337 ms | 0 - 69 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.so) |
43
- | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 306.182 ms | 12 - 141 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
44
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 157.661 ms | 37 - 77 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
45
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 230.887 ms | 0 - 287 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.so) |
46
- | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 221.376 ms | 95 - 1077 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
47
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 169.904 ms | 37 - 57 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
48
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 198.914 ms | 0 - 307 MB | FP16 | NPU | Use Export Script |
49
- | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 194.646 ms | 75 - 709 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
50
- | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 204.676 ms | 0 - 79 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
51
- | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 220.923 ms | 0 - 11 MB | FP16 | NPU | Use Export Script |
52
- | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 204.043 ms | 16 - 91 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
53
- | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 197.47 ms | 0 - 12 MB | FP16 | NPU | Use Export Script |
54
- | WhisperEncoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 196.451 ms | 38 - 98 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
55
- | WhisperEncoder | SA8775 (Proxy) | SA8775P Proxy | QNN | 222.117 ms | 0 - 11 MB | FP16 | NPU | Use Export Script |
56
- | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 207.86 ms | 22 - 101 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
57
- | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 229.82 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
58
- | WhisperEncoder | SA8295P ADP | SA8295P | TFLITE | 206.208 ms | 36 - 60 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
59
- | WhisperEncoder | SA8295P ADP | SA8295P | QNN | 224.06 ms | 1 - 6 MB | FP16 | NPU | Use Export Script |
60
- | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 267.331 ms | 15 - 61 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
61
- | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 326.29 ms | 0 - 296 MB | FP16 | NPU | Use Export Script |
62
- | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 179.11 ms | 0 - 0 MB | FP16 | NPU | Use Export Script |
63
- | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 308.125 ms | 133 - 133 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
64
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 14.3 ms | 5 - 8 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
65
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 4.046 ms | 9 - 198 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.so) |
66
- | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 34.336 ms | 0 - 116 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
67
- | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 11.516 ms | 5 - 99 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
68
- | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 3.045 ms | 20 - 61 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.so) |
69
- | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 13.486 ms | 40 - 439 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
70
- | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 9.886 ms | 4 - 53 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
71
- | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 2.743 ms | 20 - 55 MB | FP16 | NPU | Use Export Script |
72
- | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 12.014 ms | 30 - 279 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
73
- | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 14.182 ms | 5 - 8 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
74
- | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 4.129 ms | 20 - 21 MB | FP16 | NPU | Use Export Script |
75
- | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 14.303 ms | 6 - 8 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
76
- | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 4.153 ms | 20 - 25 MB | FP16 | NPU | Use Export Script |
77
- | WhisperDecoder | SA8775 (Proxy) | SA8775P Proxy | TFLITE | 14.602 ms | 5 - 7 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
78
- | WhisperDecoder | SA8775 (Proxy) | SA8775P Proxy | QNN | 4.152 ms | 20 - 22 MB | FP16 | NPU | Use Export Script |
79
- | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 14.508 ms | 5 - 8 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
80
- | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 4.092 ms | 22 - 23 MB | FP16 | NPU | Use Export Script |
81
- | WhisperDecoder | SA8295P ADP | SA8295P | TFLITE | 15.945 ms | 6 - 47 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
82
- | WhisperDecoder | SA8295P ADP | SA8295P | QNN | 5.501 ms | 18 - 23 MB | FP16 | NPU | Use Export Script |
83
- | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 16.21 ms | 6 - 93 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
84
- | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 4.862 ms | 20 - 58 MB | FP16 | NPU | Use Export Script |
85
- | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.762 ms | 20 - 20 MB | FP16 | NPU | Use Export Script |
86
- | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 14.274 ms | 107 - 107 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
 
 
 
 
87
 
88
 
89
 
@@ -148,8 +152,8 @@ Profiling Results
148
  WhisperEncoder
149
  Device : Samsung Galaxy S23 (13)
150
  Runtime : TFLITE
151
- Estimated inference time (ms) : 203.9
152
- Estimated peak memory usage (MB): [22, 93]
153
  Total # Ops : 419
154
  Compute Unit(s) : GPU (408 ops) CPU (11 ops)
155
 
@@ -157,8 +161,8 @@ Compute Unit(s) : GPU (408 ops) CPU (11 ops)
157
  WhisperDecoder
158
  Device : Samsung Galaxy S23 (13)
159
  Runtime : TFLITE
160
- Estimated inference time (ms) : 14.3
161
- Estimated peak memory usage (MB): [5, 8]
162
  Total # Ops : 983
163
  Compute Unit(s) : NPU (983 ops)
164
  ```
@@ -179,11 +183,12 @@ in memory using the `jit.trace` and then call the `submit_compile_job` API.
179
  import torch
180
 
181
  import qai_hub as hub
182
- from qai_hub_models.models.whisper_base_en import WhisperEncoder,WhisperDecoder
183
 
184
  # Load the model
185
- encoder_model = WhisperEncoder.from_pretrained()
186
- decoder_model = WhisperDecoder.from_pretrained()
 
187
 
188
  # Device
189
  device = hub.Device("Samsung Galaxy S23")
 
38
 
39
  | Model | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
+ | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 199.951 ms | 0 - 81 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
42
+ | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 269.428 ms | 0 - 81 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.so) |
43
+ | WhisperEncoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 272.557 ms | 61 - 119 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
44
+ | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 153.295 ms | 37 - 75 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
45
+ | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 184.254 ms | 0 - 290 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.so) |
46
+ | WhisperEncoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 206.768 ms | 40 - 929 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
47
+ | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 145.127 ms | 37 - 57 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
48
+ | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 173.991 ms | 0 - 304 MB | FP16 | NPU | Use Export Script |
49
+ | WhisperEncoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 181.021 ms | 95 - 638 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
50
+ | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 197.727 ms | 0 - 77 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
51
+ | WhisperEncoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 205.013 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
52
+ | WhisperEncoder | SA7255P ADP | SA7255P | TFLITE | 1157.378 ms | 37 - 63 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
53
+ | WhisperEncoder | SA7255P ADP | SA7255P | QNN | 935.22 ms | 1 - 8 MB | FP16 | NPU | Use Export Script |
54
+ | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 196.948 ms | 0 - 109 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
55
+ | WhisperEncoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 220.231 ms | 1 - 2 MB | FP16 | NPU | Use Export Script |
56
+ | WhisperEncoder | SA8295P ADP | SA8295P | TFLITE | 205.03 ms | 38 - 63 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
57
+ | WhisperEncoder | SA8295P ADP | SA8295P | QNN | 219.872 ms | 1 - 6 MB | FP16 | NPU | Use Export Script |
58
+ | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 198.057 ms | 0 - 79 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
59
+ | WhisperEncoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 202.554 ms | 0 - 20 MB | FP16 | NPU | Use Export Script |
60
+ | WhisperEncoder | SA8775P ADP | SA8775P | TFLITE | 366.822 ms | 38 - 63 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
61
+ | WhisperEncoder | SA8775P ADP | SA8775P | QNN | 194.703 ms | 1 - 6 MB | FP16 | NPU | Use Export Script |
62
+ | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 324.843 ms | 12 - 56 MB | FP16 | GPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.tflite) |
63
+ | WhisperEncoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 272.669 ms | 0 - 299 MB | FP16 | NPU | Use Export Script |
64
+ | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 159.232 ms | 0 - 0 MB | FP16 | NPU | Use Export Script |
65
+ | WhisperEncoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 297.71 ms | 133 - 133 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperEncoder.onnx) |
66
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | TFLITE | 38.364 ms | 3 - 41 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
67
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | QNN | 4.236 ms | 20 - 65 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.so) |
68
+ | WhisperDecoder | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 | ONNX | 16.021 ms | 0 - 117 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
69
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | TFLITE | 30.646 ms | 0 - 76 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
70
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | QNN | 3.339 ms | 0 - 51 MB | FP16 | NPU | [Whisper-Base-En.so](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.so) |
71
+ | WhisperDecoder | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 | ONNX | 13.615 ms | 53 - 277 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
72
+ | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | TFLITE | 26.759 ms | 4 - 75 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
73
+ | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | QNN | 2.558 ms | 20 - 69 MB | FP16 | NPU | Use Export Script |
74
+ | WhisperDecoder | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite | ONNX | 12.032 ms | 28 - 229 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
75
+ | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | TFLITE | 38.53 ms | 0 - 39 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
76
+ | WhisperDecoder | QCS8550 (Proxy) | QCS8550 Proxy | QNN | 4.204 ms | 20 - 22 MB | FP16 | NPU | Use Export Script |
77
+ | WhisperDecoder | SA7255P ADP | SA7255P | TFLITE | 68.01 ms | 3 - 74 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
78
+ | WhisperDecoder | SA7255P ADP | SA7255P | QNN | 26.944 ms | 20 - 26 MB | FP16 | NPU | Use Export Script |
79
+ | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | TFLITE | 38.528 ms | 6 - 44 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
80
+ | WhisperDecoder | SA8255 (Proxy) | SA8255P Proxy | QNN | 4.237 ms | 20 - 22 MB | FP16 | NPU | Use Export Script |
81
+ | WhisperDecoder | SA8295P ADP | SA8295P | TFLITE | 41.065 ms | 6 - 70 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
82
+ | WhisperDecoder | SA8295P ADP | SA8295P | QNN | 5.798 ms | 18 - 23 MB | FP16 | NPU | Use Export Script |
83
+ | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | TFLITE | 38.144 ms | 4 - 40 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
84
+ | WhisperDecoder | SA8650 (Proxy) | SA8650P Proxy | QNN | 4.157 ms | 21 - 22 MB | FP16 | NPU | Use Export Script |
85
+ | WhisperDecoder | SA8775P ADP | SA8775P | TFLITE | 38.291 ms | 5 - 76 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
86
+ | WhisperDecoder | SA8775P ADP | SA8775P | QNN | 5.574 ms | 18 - 23 MB | FP16 | NPU | Use Export Script |
87
+ | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | TFLITE | 43.741 ms | 2 - 77 MB | FP16 | NPU | [Whisper-Base-En.tflite](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.tflite) |
88
+ | WhisperDecoder | QCS8450 (Proxy) | QCS8450 Proxy | QNN | 5.068 ms | 18 - 66 MB | FP16 | NPU | Use Export Script |
89
+ | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN | 3.805 ms | 20 - 20 MB | FP16 | NPU | Use Export Script |
90
+ | WhisperDecoder | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 14.392 ms | 107 - 107 MB | FP16 | NPU | [Whisper-Base-En.onnx](https://huggingface.co/qualcomm/Whisper-Base-En/blob/main/WhisperDecoder.onnx) |
91
 
92
 
93
 
 
152
  WhisperEncoder
153
  Device : Samsung Galaxy S23 (13)
154
  Runtime : TFLITE
155
+ Estimated inference time (ms) : 200.0
156
+ Estimated peak memory usage (MB): [0, 81]
157
  Total # Ops : 419
158
  Compute Unit(s) : GPU (408 ops) CPU (11 ops)
159
 
 
161
  WhisperDecoder
162
  Device : Samsung Galaxy S23 (13)
163
  Runtime : TFLITE
164
+ Estimated inference time (ms) : 38.4
165
+ Estimated peak memory usage (MB): [3, 41]
166
  Total # Ops : 983
167
  Compute Unit(s) : NPU (983 ops)
168
  ```
 
183
  import torch
184
 
185
  import qai_hub as hub
186
+ from qai_hub_models.models.whisper_base_en import Model
187
 
188
  # Load the model
189
+ model = Model.from_pretrained()
190
+ encoder_model = model.encoder
191
+ decoder_model = model.decoder
192
 
193
  # Device
194
  device = hub.Device("Samsung Galaxy S23")