Upload README.md with huggingface_hub
Browse files
README.md
CHANGED
|
@@ -38,30 +38,32 @@ More details on model performance across various devices, can be found
|
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
| 41 |
-
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
| 42 |
-
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
| 43 |
-
| WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE |
|
| 44 |
-
| WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE |
|
| 45 |
-
| WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE |
|
| 46 |
-
| WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE |
|
| 47 |
-
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE |
|
| 48 |
-
| WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE |
|
| 49 |
-
| WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE |
|
| 50 |
-
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE |
|
| 51 |
-
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE |
|
| 52 |
-
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE |
|
| 53 |
-
|
|
| 54 |
-
| WhisperDecoderInf | float |
|
| 55 |
-
| WhisperDecoderInf | float |
|
| 56 |
-
| WhisperDecoderInf | float |
|
| 57 |
-
| WhisperDecoderInf | float |
|
| 58 |
-
| WhisperDecoderInf | float |
|
| 59 |
-
| WhisperDecoderInf | float |
|
| 60 |
-
| WhisperDecoderInf | float |
|
| 61 |
-
| WhisperDecoderInf | float |
|
| 62 |
-
| WhisperDecoderInf | float |
|
| 63 |
-
| WhisperDecoderInf | float | Samsung Galaxy
|
| 64 |
-
| WhisperDecoderInf | float |
|
|
|
|
|
|
|
| 65 |
|
| 66 |
|
| 67 |
|
|
@@ -125,8 +127,8 @@ Profiling Results
|
|
| 125 |
WhisperEncoderInf
|
| 126 |
Device : cs_8275 (ANDROID 14)
|
| 127 |
Runtime : TFLITE
|
| 128 |
-
Estimated inference time (ms) :
|
| 129 |
-
Estimated peak memory usage (MB): [
|
| 130 |
Total # Ops : 911
|
| 131 |
Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
| 132 |
|
|
@@ -134,7 +136,7 @@ Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
|
| 134 |
WhisperDecoderInf
|
| 135 |
Device : cs_8275 (ANDROID 14)
|
| 136 |
Runtime : TFLITE
|
| 137 |
-
Estimated inference time (ms) :
|
| 138 |
Estimated peak memory usage (MB): [16, 268]
|
| 139 |
Total # Ops : 2573
|
| 140 |
Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
|
|
|
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
| 41 |
+
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3226.414 ms | 108 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 42 |
+
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 986.224 ms | 101 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 43 |
+
| WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 1103.354 ms | 74 - 155 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 44 |
+
| WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1271.289 ms | 100 - 133 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 45 |
+
| WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3226.414 ms | 108 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 46 |
+
| WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 779.384 ms | 24 - 121 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 47 |
+
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 656.617 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 48 |
+
| WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 699.122 ms | 110 - 183 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 49 |
+
| WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1271.289 ms | 100 - 133 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 50 |
+
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 1245.355 ms | 0 - 164 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 51 |
+
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 525.323 ms | 110 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 52 |
+
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 542.231 ms | 111 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 53 |
+
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 609.485 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
| 54 |
+
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 65.318 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 55 |
+
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 62.718 ms | 16 - 404 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 56 |
+
| WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 55.629 ms | 16 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 57 |
+
| WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 52.514 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 58 |
+
| WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 65.318 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 59 |
+
| WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 54.617 ms | 16 - 44 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 60 |
+
| WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 56.316 ms | 16 - 248 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 61 |
+
| WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 55.232 ms | 14 - 40 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 62 |
+
| WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 52.514 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 63 |
+
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 55.786 ms | 16 - 45 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 64 |
+
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 45.752 ms | 16 - 412 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 65 |
+
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 42.151 ms | 23 - 278 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 66 |
+
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 48.51 ms | 226 - 226 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
| 67 |
|
| 68 |
|
| 69 |
|
|
|
|
| 127 |
WhisperEncoderInf
|
| 128 |
Device : cs_8275 (ANDROID 14)
|
| 129 |
Runtime : TFLITE
|
| 130 |
+
Estimated inference time (ms) : 3226.4
|
| 131 |
+
Estimated peak memory usage (MB): [108, 141]
|
| 132 |
Total # Ops : 911
|
| 133 |
Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
| 134 |
|
|
|
|
| 136 |
WhisperDecoderInf
|
| 137 |
Device : cs_8275 (ANDROID 14)
|
| 138 |
Runtime : TFLITE
|
| 139 |
+
Estimated inference time (ms) : 65.3
|
| 140 |
Estimated peak memory usage (MB): [16, 268]
|
| 141 |
Total # Ops : 2573
|
| 142 |
Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
|