qaihm-bot commited on
Commit
b81b6b3
·
verified ·
1 Parent(s): 6512559

See https://github.com/quic/ai-hub-models/releases/v0.34.0 for changelog.

README.md CHANGED
@@ -24,6 +24,7 @@ More details on model performance across various devices, can be found
24
  [here](https://aihub.qualcomm.com/models/openai_clip).
25
 
26
 
 
27
  ### Model Details
28
 
29
  - **Model Type:** Model_use_case.image_classification
@@ -36,31 +37,31 @@ More details on model performance across various devices, can be found
36
 
37
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
38
  |---|---|---|---|---|---|---|---|---|
39
- | OpenAI-Clip | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 71.105 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
40
  | OpenAI-Clip | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 58.433 ms | 1 - 481 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
41
- | OpenAI-Clip | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 26.259 ms | 0 - 360 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
42
  | OpenAI-Clip | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 22.93 ms | 1 - 452 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
43
- | OpenAI-Clip | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 23.733 ms | 0 - 117 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
44
  | OpenAI-Clip | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 19.924 ms | 0 - 53 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
45
- | OpenAI-Clip | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 28.475 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
46
  | OpenAI-Clip | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 22.704 ms | 0 - 484 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
47
- | OpenAI-Clip | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 71.105 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
48
  | OpenAI-Clip | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 58.433 ms | 1 - 481 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
49
- | OpenAI-Clip | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 23.741 ms | 0 - 51 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
50
  | OpenAI-Clip | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 19.803 ms | 0 - 55 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
51
- | OpenAI-Clip | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 29.538 ms | 0 - 349 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
52
  | OpenAI-Clip | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 23.717 ms | 1 - 479 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
53
- | OpenAI-Clip | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 23.789 ms | 0 - 33 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
54
  | OpenAI-Clip | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 19.847 ms | 0 - 60 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
55
- | OpenAI-Clip | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 28.475 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
56
  | OpenAI-Clip | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 22.704 ms | 0 - 484 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
57
- | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 23.566 ms | 0 - 29 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
58
  | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 19.911 ms | 0 - 57 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
59
  | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 25.037 ms | 1 - 39 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
60
- | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 16.784 ms | 0 - 415 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
61
  | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 14.146 ms | 1 - 491 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
62
  | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 17.712 ms | 0 - 531 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
63
- | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 13.737 ms | 0 - 406 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
64
  | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 13.268 ms | 1 - 470 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
65
  | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 17.115 ms | 1 - 510 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
66
  | OpenAI-Clip | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 21.759 ms | 1667 - 1667 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
@@ -122,17 +123,7 @@ device. This script does the following:
122
  ```bash
123
  python -m qai_hub_models.models.openai_clip.export
124
  ```
125
- ```
126
- Profiling Results
127
- ------------------------------------------------------------
128
- OpenAI-Clip
129
- Device : cs_8275 (ANDROID 14)
130
- Runtime : TFLITE
131
- Estimated inference time (ms) : 71.1
132
- Estimated peak memory usage (MB): [0, 407]
133
- Total # Ops : 1320
134
- Compute Unit(s) : npu (1318 ops) gpu (0 ops) cpu (2 ops)
135
- ```
136
 
137
 
138
  ## How does this work?
 
24
  [here](https://aihub.qualcomm.com/models/openai_clip).
25
 
26
 
27
+
28
  ### Model Details
29
 
30
  - **Model Type:** Model_use_case.image_classification
 
37
 
38
  | Model | Precision | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Primary Compute Unit | Target Model
39
  |---|---|---|---|---|---|---|---|---|
40
+ | OpenAI-Clip | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | TFLITE | 70.94 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
41
  | OpenAI-Clip | float | QCS8275 (Proxy) | Qualcomm® QCS8275 (Proxy) | QNN_DLC | 58.433 ms | 1 - 481 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
42
+ | OpenAI-Clip | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | TFLITE | 26.126 ms | 0 - 359 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
43
  | OpenAI-Clip | float | QCS8450 (Proxy) | Qualcomm® QCS8450 (Proxy) | QNN_DLC | 22.93 ms | 1 - 452 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
44
+ | OpenAI-Clip | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | TFLITE | 23.098 ms | 0 - 81 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
45
  | OpenAI-Clip | float | QCS8550 (Proxy) | Qualcomm® QCS8550 (Proxy) | QNN_DLC | 19.924 ms | 0 - 53 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
46
+ | OpenAI-Clip | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | TFLITE | 28.575 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
47
  | OpenAI-Clip | float | QCS9075 (Proxy) | Qualcomm® QCS9075 (Proxy) | QNN_DLC | 22.704 ms | 0 - 484 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
48
+ | OpenAI-Clip | float | SA7255P ADP | Qualcomm® SA7255P | TFLITE | 70.94 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
49
  | OpenAI-Clip | float | SA7255P ADP | Qualcomm® SA7255P | QNN_DLC | 58.433 ms | 1 - 481 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
50
+ | OpenAI-Clip | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | TFLITE | 23.587 ms | 0 - 72 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
51
  | OpenAI-Clip | float | SA8255 (Proxy) | Qualcomm® SA8255P (Proxy) | QNN_DLC | 19.803 ms | 0 - 55 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
52
+ | OpenAI-Clip | float | SA8295P ADP | Qualcomm® SA8295P | TFLITE | 29.464 ms | 0 - 349 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
53
  | OpenAI-Clip | float | SA8295P ADP | Qualcomm® SA8295P | QNN_DLC | 23.717 ms | 1 - 479 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
54
+ | OpenAI-Clip | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | TFLITE | 24.016 ms | 0 - 26 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
55
  | OpenAI-Clip | float | SA8650 (Proxy) | Qualcomm® SA8650P (Proxy) | QNN_DLC | 19.847 ms | 0 - 60 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
56
+ | OpenAI-Clip | float | SA8775P ADP | Qualcomm® SA8775P | TFLITE | 28.575 ms | 0 - 407 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
57
  | OpenAI-Clip | float | SA8775P ADP | Qualcomm® SA8775P | QNN_DLC | 22.704 ms | 0 - 484 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
58
+ | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | TFLITE | 23.76 ms | 0 - 26 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
59
  | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | QNN_DLC | 19.911 ms | 0 - 57 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
60
  | OpenAI-Clip | float | Samsung Galaxy S23 | Snapdragon® 8 Gen 2 Mobile | ONNX | 25.037 ms | 1 - 39 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
61
+ | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | TFLITE | 16.86 ms | 0 - 413 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
62
  | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | QNN_DLC | 14.146 ms | 1 - 491 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
63
  | OpenAI-Clip | float | Samsung Galaxy S24 | Snapdragon® 8 Gen 3 Mobile | ONNX | 17.712 ms | 0 - 531 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
64
+ | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | TFLITE | 15.996 ms | 0 - 406 MB | NPU | [OpenAI-Clip.tflite](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.tflite) |
65
  | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | QNN_DLC | 13.268 ms | 1 - 470 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
66
  | OpenAI-Clip | float | Snapdragon 8 Elite QRD | Snapdragon® 8 Elite Mobile | ONNX | 17.115 ms | 1 - 510 MB | NPU | [OpenAI-Clip.onnx](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.onnx) |
67
  | OpenAI-Clip | float | Snapdragon X Elite CRD | Snapdragon® X Elite | QNN_DLC | 21.759 ms | 1667 - 1667 MB | NPU | [OpenAI-Clip.dlc](https://huggingface.co/qualcomm/OpenAI-Clip/blob/main/OpenAI-Clip.dlc) |
 
123
  ```bash
124
  python -m qai_hub_models.models.openai_clip.export
125
  ```
126
+
 
 
 
 
 
 
 
 
 
 
127
 
128
 
129
  ## How does this work?
precompiled/qualcomm-snapdragon-x-elite/OpenAI-Clip.onnx.zip CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6e5531d623ca61cf7dd474bdfe01339068595ac73dfc4f909587e9f6c611d5f8
3
  size 278937907
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8da4583071cdc451eaf52d67f1bb5972b5634c5513671333122ecf0f5e8a20c8
3
  size 278937907
precompiled/qualcomm-snapdragon-x-elite/sdk_versions.yml ADDED
@@ -0,0 +1,5 @@
 
 
 
 
 
 
1
+ sdk_versions:
2
+ qnn_context_binary:
3
+ qairt: 2.34.2.250528164111_119506
4
+ precompiled_qnn_onnx:
5
+ qairt: 2.33.2.250410134701_117956