qaihm-bot commited on
Commit
3d5911b
·
verified ·
1 Parent(s): cfe34a6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +29 -27
README.md CHANGED
@@ -38,30 +38,32 @@ More details on model performance across various devices, can be found
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
- | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3217.613 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
42
- | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 930.722 ms | 108 - 209 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
43
- | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 927.254 ms | 91 - 120 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
44
- | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1269.961 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
45
- | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3217.613 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
46
- | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 689.249 ms | 83 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
47
- | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 592.665 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
48
- | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 683.522 ms | 110 - 148 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
49
- | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1269.961 ms | 93 - 125 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
50
- | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 967.038 ms | 56 - 79 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
- | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 761.71 ms | 107 - 202 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
- | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 462.676 ms | 109 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
53
- | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 64.757 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
54
- | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 66.789 ms | 16 - 404 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
- | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 54.77 ms | 14 - 40 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
56
- | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 52.42 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
57
- | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 64.757 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
58
- | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 55.735 ms | 16 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
59
- | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 56.015 ms | 16 - 248 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
60
- | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 54.869 ms | 16 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
61
- | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 52.42 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
62
- | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 55.021 ms | 12 - 39 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
63
- | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 45.081 ms | 6 - 406 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
64
- | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 41.701 ms | 15 - 269 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 
 
65
 
66
 
67
 
@@ -125,8 +127,8 @@ Profiling Results
125
  WhisperEncoderInf
126
  Device : cs_8275 (ANDROID 14)
127
  Runtime : TFLITE
128
- Estimated inference time (ms) : 3217.6
129
- Estimated peak memory usage (MB): [109, 142]
130
  Total # Ops : 911
131
  Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
132
 
@@ -134,7 +136,7 @@ Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
134
  WhisperDecoderInf
135
  Device : cs_8275 (ANDROID 14)
136
  Runtime : TFLITE
137
- Estimated inference time (ms) : 64.8
138
  Estimated peak memory usage (MB): [16, 268]
139
  Total # Ops : 2573
140
  Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
 
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
41
+ | WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3226.414 ms | 108 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
42
+ | WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 986.224 ms | 101 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
43
+ | WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 1103.354 ms | 74 - 155 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
44
+ | WhisperEncoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 1271.289 ms | 100 - 133 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
45
+ | WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3226.414 ms | 108 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
46
+ | WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 779.384 ms | 24 - 121 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
47
+ | WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 656.617 ms | 109 - 142 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
48
+ | WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 699.122 ms | 110 - 183 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
49
+ | WhisperEncoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 1271.289 ms | 100 - 133 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
50
+ | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 1245.355 ms | 0 - 164 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
+ | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 525.323 ms | 110 - 200 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
+ | WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 542.231 ms | 111 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
53
+ | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 609.485 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
54
+ | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 65.318 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
+ | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 62.718 ms | 16 - 404 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
56
+ | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 55.629 ms | 16 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
57
+ | WhisperDecoderInf | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 52.514 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
58
+ | WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 65.318 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
59
+ | WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 54.617 ms | 16 - 44 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
60
+ | WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 56.316 ms | 16 - 248 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
61
+ | WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 55.232 ms | 14 - 40 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
62
+ | WhisperDecoderInf | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 52.514 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
63
+ | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 55.786 ms | 16 - 45 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
64
+ | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 45.752 ms | 16 - 412 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
65
+ | WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 42.151 ms | 23 - 278 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
66
+ | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 48.51 ms | 226 - 226 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
67
 
68
 
69
 
 
127
  WhisperEncoderInf
128
  Device : cs_8275 (ANDROID 14)
129
  Runtime : TFLITE
130
+ Estimated inference time (ms) : 3226.4
131
+ Estimated peak memory usage (MB): [108, 141]
132
  Total # Ops : 911
133
  Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
134
 
 
136
  WhisperDecoderInf
137
  Device : cs_8275 (ANDROID 14)
138
  Runtime : TFLITE
139
+ Estimated inference time (ms) : 65.3
140
  Estimated peak memory usage (MB): [16, 268]
141
  Total # Ops : 2573
142
  Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)