v0.32.0

Browse files

See https://github.com/quic/ai-hub-models/releases/v0.32.0 for changelog.

Files changed (5) hide show

.gitattributes +1 -0
Whisper-Small-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf +2 -2
LICENSE +2 -0
README.md +4 -6
Whisper-Small-En_WhisperEncoderInf.onnx +0 -3

.gitattributes CHANGED Viewed

@@ -37,3 +37,4 @@ WhisperEncoder.so filter=lfs diff=lfs merge=lfs -text
 WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
 Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
 Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text

 WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
 Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
 Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
+DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text

Whisper-Small-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf RENAMED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b1f595d19ce40056d5531b9accf21826016a84ee02e03b3c03c7ba0da0831af9
-size 716944574

 version https://git-lfs.github.com/spec/v1
+oid sha256:4409f93b0e82531303b3e10f52f1fdfb56467a25f05b7441c6bbd8bb8a64b42c
+size 109629

LICENSE ADDED Viewed

	@@ -0,0 +1,2 @@


1	+ The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
2	+ The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.

README.md CHANGED Viewed

@@ -31,10 +31,10 @@ More details on model performance across various devices, can be found
   - Model checkpoint: small.en
   - Input resolution: 80x3000 (30 seconds audio)
   - Mean decoded sequence length: 112 tokens
-  - Number of parameters (WhisperEncoder): 102M
-  - Model size (WhisperEncoder): 390 MB
-  - Number of parameters (WhisperDecoder): 139M
-  - Model size (WhisperDecoder): 531 MB
 | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
@@ -50,7 +50,6 @@ More details on model performance across various devices, can be found
 | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
-| WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 631.145 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
 | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
@@ -63,7 +62,6 @@ More details on model performance across various devices, can be found
 | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
-| WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 34.753 ms | 227 - 227 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |

   - Model checkpoint: small.en
   - Input resolution: 80x3000 (30 seconds audio)
   - Mean decoded sequence length: 112 tokens
+  - Number of parameters (WhisperEncoderInf): 102M
+  - Model size (WhisperEncoderInf) (float): 390 MB
+  - Number of parameters (WhisperDecoderInf): 139M
+  - Model size (WhisperDecoderInf) (float): 532 MB
 | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
 |---|---|---|---|---|---|---|---|---|
 | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
 | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |

Whisper-Small-En_WhisperEncoderInf.onnx DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:9f819a684ebc3dc0d9d19ebe87a8c11a74a5685162d748cb286b360042dbb4b5
-size 409481559