qaihm-bot commited on
Commit
9a4c5a8
·
verified ·
1 Parent(s): 962dadb

See https://github.com/quic/ai-hub-models/releases/v0.32.0 for changelog.

.gitattributes CHANGED
@@ -37,3 +37,4 @@ WhisperEncoder.so filter=lfs diff=lfs merge=lfs -text
37
  WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
38
  Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
39
  Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
 
 
37
  WhisperDecoder.so filter=lfs diff=lfs merge=lfs -text
38
  Whisper-Small-En_WhisperDecoderInf.dlc filter=lfs diff=lfs merge=lfs -text
39
  Whisper-Small-En_WhisperEncoderInf.dlc filter=lfs diff=lfs merge=lfs -text
40
+ DEPLOYMENT_MODEL_LICENSE.pdf filter=lfs diff=lfs merge=lfs -text
Whisper-Small-En_WhisperDecoderInf.onnx → DEPLOYMENT_MODEL_LICENSE.pdf RENAMED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b1f595d19ce40056d5531b9accf21826016a84ee02e03b3c03c7ba0da0831af9
3
- size 716944574
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4409f93b0e82531303b3e10f52f1fdfb56467a25f05b7441c6bbd8bb8a64b42c
3
+ size 109629
LICENSE ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ The license of the original trained model can be found at https://github.com/openai/whisper/blob/main/LICENSE.
2
+ The license for the deployable model files (.tflite, .onnx, .dlc, .bin, etc.) can be found in DEPLOYMENT_MODEL_LICENSE.pdf.
README.md CHANGED
@@ -31,10 +31,10 @@ More details on model performance across various devices, can be found
31
  - Model checkpoint: small.en
32
  - Input resolution: 80x3000 (30 seconds audio)
33
  - Mean decoded sequence length: 112 tokens
34
- - Number of parameters (WhisperEncoder): 102M
35
- - Model size (WhisperEncoder): 390 MB
36
- - Number of parameters (WhisperDecoder): 139M
37
- - Model size (WhisperDecoder): 531 MB
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
@@ -50,7 +50,6 @@ More details on model performance across various devices, can be found
50
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
53
- | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 631.145 ms | 295 - 295 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
54
  | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
  | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
56
  | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
@@ -63,7 +62,6 @@ More details on model performance across various devices, can be found
63
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
64
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
65
  | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
66
- | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | ONNX | 34.753 ms | 227 - 227 MB | NPU | [Whisper-Small-En.onnx](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.onnx) |
67
 
68
 
69
 
 
31
  - Model checkpoint: small.en
32
  - Input resolution: 80x3000 (30 seconds audio)
33
  - Mean decoded sequence length: 112 tokens
34
+ - Number of parameters (WhisperEncoderInf): 102M
35
+ - Model size (WhisperEncoderInf) (float): 390 MB
36
+ - Number of parameters (WhisperDecoderInf): 139M
37
+ - Model size (WhisperDecoderInf) (float): 532 MB
38
 
39
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
40
  |---|---|---|---|---|---|---|---|---|
 
50
  | WhisperEncoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 686.994 ms | 85 - 167 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
51
  | WhisperEncoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 518.045 ms | 109 - 201 MB | GPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
52
  | WhisperEncoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 545.405 ms | 381 - 381 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
 
53
  | WhisperDecoderInf | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.015 ms | 16 - 384 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
54
  | WhisperDecoderInf | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 49.307 ms | 16 - 397 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
55
  | WhisperDecoderInf | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 49.176 ms | 5 - 43 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
 
62
  | WhisperDecoderInf | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 50.949 ms | 16 - 49 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
63
  | WhisperDecoderInf | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 44.926 ms | 16 - 419 MB | NPU | [Whisper-Small-En.tflite](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.tflite) |
64
  | WhisperDecoderInf | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 34.427 ms | 1205 - 1205 MB | NPU | [Whisper-Small-En.dlc](https://huggingface.co/qualcomm/Whisper-Small-En/blob/main/Whisper-Small-En.dlc) |
 
65
 
66
 
67
 
Whisper-Small-En_WhisperEncoderInf.onnx DELETED
@@ -1,3 +0,0 @@
1
- version https://git-lfs.github.com/spec/v1
2
- oid sha256:9f819a684ebc3dc0d9d19ebe87a8c11a74a5685162d748cb286b360042dbb4b5
3
- size 409481559