v0.29.1
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.29.1 for changelog.
- README.md +25 -29
- WhisperDecoder.bin → Whisper-Small-En_WhisperDecoderInf.onnx +2 -2
- WhisperDecoder.onnx → Whisper-Small-En_WhisperDecoderInf.tflite +2 -2
- WhisperEncoderInf.onnx → Whisper-Small-En_WhisperEncoderInf.onnx +0 -0
- WhisperEncoder.tflite → Whisper-Small-En_WhisperEncoderInf.tflite +0 -0
- WhisperDecoder.so +0 -3
- WhisperDecoder.tflite +0 -3
- WhisperDecoderInf.tflite +0 -3
- WhisperEncoder.bin +0 -3
- WhisperEncoder.onnx +0 -3
- WhisperEncoder.so +0 -3
- WhisperEncoderInf.tflite +0 -3
README.md
CHANGED
@@ -38,32 +38,28 @@ More details on model performance across various devices, can be found
|
|
38 |
|
39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
40 |
|---|---|---|---|---|---|---|---|---|
|
41 |
-
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE |
|
42 |
-
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE |
|
43 |
-
| WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE |
|
44 |
-
| WhisperEncoderInf | float |
|
45 |
-
| WhisperEncoderInf | float |
|
46 |
-
| WhisperEncoderInf | float |
|
47 |
-
| WhisperEncoderInf | float |
|
48 |
-
| WhisperEncoderInf | float |
|
49 |
-
| WhisperEncoderInf | float |
|
50 |
-
| WhisperEncoderInf | float |
|
51 |
-
| WhisperEncoderInf | float |
|
52 |
-
|
|
53 |
-
|
|
54 |
-
| WhisperDecoderInf | float |
|
55 |
-
| WhisperDecoderInf | float |
|
56 |
-
| WhisperDecoderInf | float |
|
57 |
-
| WhisperDecoderInf | float |
|
58 |
-
| WhisperDecoderInf | float |
|
59 |
-
| WhisperDecoderInf | float |
|
60 |
-
| WhisperDecoderInf | float |
|
61 |
-
| WhisperDecoderInf | float |
|
62 |
-
| WhisperDecoderInf | float |
|
63 |
-
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 55.786 ms | 16 - 45 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
64 |
-
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 45.752 ms | 16 - 412 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
65 |
-
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 42.151 ms | 23 - 278 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
66 |
-
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 48.51 ms | 226 - 226 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
67 |
|
68 |
|
69 |
|
@@ -127,8 +123,8 @@ Profiling Results
|
|
127 |
WhisperEncoderInf
|
128 |
Device : cs_8275 (ANDROID 14)
|
129 |
Runtime : TFLITE
|
130 |
-
Estimated inference time (ms) :
|
131 |
-
Estimated peak memory usage (MB): [
|
132 |
Total # Ops : 911
|
133 |
Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
134 |
|
@@ -136,7 +132,7 @@ Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
|
136 |
WhisperDecoderInf
|
137 |
Device : cs_8275 (ANDROID 14)
|
138 |
Runtime : TFLITE
|
139 |
-
Estimated inference time (ms) :
|
140 |
Estimated peak memory usage (MB): [16, 268]
|
141 |
Total # Ops : 2573
|
142 |
Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
|
|
|
38 |
|
39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
40 |
|---|---|---|---|---|---|---|---|---|
|
41 |
+
| WhisperEncoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 3224.417 ms | 107 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
42 |
+
| WhisperEncoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 1638.428 ms | 109 - 208 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
43 |
+
| WhisperEncoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 688.683 ms | 110 - 143 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
44 |
+
| WhisperEncoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 3224.417 ms | 107 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
45 |
+
| WhisperEncoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 897.319 ms | 100 - 123 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
46 |
+
| WhisperEncoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 656.478 ms | 109 - 140 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
47 |
+
| WhisperEncoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 685.162 ms | 18 - 181 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
48 |
+
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 817.059 ms | 104 - 147 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
49 |
+
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 524.574 ms | 109 - 203 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
50 |
+
| WhisperEncoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 543.977 ms | 110 - 141 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
51 |
+
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 607.169 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
52 |
+
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 64.601 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
53 |
+
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 61.873 ms | 16 - 405 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
54 |
+
| WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 54.727 ms | 12 - 40 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
55 |
+
| WhisperDecoderInf | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 64.601 ms | 16 - 268 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
56 |
+
| WhisperDecoderInf | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 55.594 ms | 16 - 42 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
57 |
+
| WhisperDecoderInf | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 56.165 ms | 16 - 248 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
58 |
+
| WhisperDecoderInf | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 55.696 ms | 16 - 42 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
59 |
+
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 55.186 ms | 16 - 44 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
60 |
+
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.815 ms | 14 - 417 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
61 |
+
| WhisperDecoderInf | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 40.741 ms | 15 - 270 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
62 |
+
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 46.9 ms | 227 - 227 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
|
|
|
|
|
|
|
|
63 |
|
64 |
|
65 |
|
|
|
123 |
WhisperEncoderInf
|
124 |
Device : cs_8275 (ANDROID 14)
|
125 |
Runtime : TFLITE
|
126 |
+
Estimated inference time (ms) : 3224.4
|
127 |
+
Estimated peak memory usage (MB): [107, 140]
|
128 |
Total # Ops : 911
|
129 |
Compute Unit(s) : npu (0 ops) gpu (900 ops) cpu (11 ops)
|
130 |
|
|
|
132 |
WhisperDecoderInf
|
133 |
Device : cs_8275 (ANDROID 14)
|
134 |
Runtime : TFLITE
|
135 |
+
Estimated inference time (ms) : 64.6
|
136 |
Estimated peak memory usage (MB): [16, 268]
|
137 |
Total # Ops : 2573
|
138 |
Compute Unit(s) : npu (2573 ops) gpu (0 ops) cpu (0 ops)
|
WhisperDecoder.bin → Whisper-Small-En_WhisperDecoderInf.onnx
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b1f595d19ce40056d5531b9accf21826016a84ee02e03b3c03c7ba0da0831af9
|
3 |
+
size 716944574
|
WhisperDecoder.onnx → Whisper-Small-En_WhisperDecoderInf.tflite
RENAMED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2c2049a24dbe4c1fcdc354b12d15e1bfad705f70316febd095f8f67c81745fe8
|
3 |
+
size 557615568
|
WhisperEncoderInf.onnx → Whisper-Small-En_WhisperEncoderInf.onnx
RENAMED
File without changes
|
WhisperEncoder.tflite → Whisper-Small-En_WhisperEncoderInf.tflite
RENAMED
File without changes
|
WhisperDecoder.so
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:55d7f7466acec5afdde4b61d906b41857512de8a63c31ed1b50ce7ae18205ea1
|
3 |
-
size 361676272
|
|
|
|
|
|
|
|
WhisperDecoder.tflite
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:a612f76581dfd7caadf1a563bb55ef962fad57d8c16be08508d0958027aee2d7
|
3 |
-
size 557617160
|
|
|
|
|
|
|
|
WhisperDecoderInf.tflite
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:89313eac23258eca02ba4afe875971b5742cda08f95f2c4c4d0b77daa5d500b8
|
3 |
-
size 557616808
|
|
|
|
|
|
|
|
WhisperEncoder.bin
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:40c88b5954377b3682db6bb46453757b46d6ef12e11af2421c4ab6de703647ab
|
3 |
-
size 248188072
|
|
|
|
|
|
|
|
WhisperEncoder.onnx
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:f001457af241e0b37311a585a59cdcbdb1646049f28c259a26fe021b682a0985
|
3 |
-
size 409498271
|
|
|
|
|
|
|
|
WhisperEncoder.so
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:a2e56d11e4b3197aa5b13f4f2b4f343b06d762772ec284a3b60ce09a26b28e9a
|
3 |
-
size 207519296
|
|
|
|
|
|
|
|
WhisperEncoderInf.tflite
DELETED
@@ -1,3 +0,0 @@
|
|
1 |
-
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:2d63084276fa4f2d937797dc9447ef67c8f18814b04848171dac47ad54eeab3a
|
3 |
-
size 409468768
|
|
|
|
|
|
|
|