v0.32.0
Browse filesSee https://github.com/quic/ai-hub-models/releases/v0.32.0 for changelog.
.gitattributes
CHANGED
|
@@ -37,3 +37,4 @@ WhisperEncoder.so filter=lfs diff=lfs merge=lfs -text
|
|
| 37 |
WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
|
| 38 |
Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
|
| 39 |
Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 37 |
WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
|
| 38 |
Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
|
| 39 |
Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
|
Whisper-Small-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf
RENAMED
|
@@ -1,3 +1,3 @@
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:
|
| 3 |
-
size
|
|
|
|
| 1 |
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:4409f93b0e82531303b3e10f52f1fdfb56467a25f05b7441c6bbd8bb8a64b42c
|
| 3 |
+
size 109629
|
LICENSE
ADDED
|
@@ -0,0 +1,2 @@
|
|
|
|
|
|
|
|
|
|
| 1 |
+
The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
|
| 2 |
+
The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
|
README.md
CHANGED
|
@@ -31,10 +31,10 @@ More details on model performance across various devices, can be found
|
|
| 31 |
- Model checkpoint: small.en
|
| 32 |
- Input resolution: 80x3000 (30 seconds audio)
|
| 33 |
- Mean decoded sequence length: 112 tokens
|
| 34 |
-
- Number of parameters (
|
| 35 |
-
- Model size (
|
| 36 |
-
- Number of parameters (
|
| 37 |
-
- Model size (
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
|
@@ -50,7 +50,6 @@ More details on model performance across various devices, can be found
|
|
| 50 |
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 51 |
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 52 |
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
|
| 53 |
-
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 631.145 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
| 54 |
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 55 |
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 56 |
| WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
|
@@ -63,7 +62,6 @@ More details on model performance across various devices, can be found
|
|
| 63 |
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 64 |
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 65 |
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
|
| 66 |
-
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 34.753 ms | 227 - 227 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
|
| 67 |
|
| 68 |
|
| 69 |
|
|
|
|
| 31 |
- Model checkpoint: small.en
|
| 32 |
- Input resolution: 80x3000 (30 seconds audio)
|
| 33 |
- Mean decoded sequence length: 112 tokens
|
| 34 |
+
- Number of parameters (WhisperEncoderInf): 102M
|
| 35 |
+
- Model size (WhisperEncoderInf) (float): 390 MB
|
| 36 |
+
- Number of parameters (WhisperDecoderInf): 139M
|
| 37 |
+
- Model size (WhisperDecoderInf) (float): 532 MB
|
| 38 |
|
| 39 |
| Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
|
| 40 |
|---|---|---|---|---|---|---|---|---|
|
|
|
|
| 50 |
| WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 51 |
| WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 52 |
| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
|
|
|
|
| 53 |
| WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 54 |
| WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 55 |
| WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
|
|
|
| 62 |
| WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 63 |
| WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
|
| 64 |
| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
|
|
|
|
| 65 |
|
| 66 |
|
| 67 |
|
Whisper-Small-En_WhisperEncoderInf.onnx
DELETED
|
@@ -1,3 +0,0 @@
|
|
| 1 |
-
version https://git-lfs.github.com/spec/v1
|
| 2 |
-
oid sha256:9f819a684ebc3dc0d9d19ebe87a8c11a74a5685162d748cb286b360042dbb4b5
|
| 3 |
-
size 409481559
|
|
|
|
|
|
|
|
|
|
|
|