qualcomm
/

EfficientNet-B0

Image Classification

PyTorch

TF Lite

backbone

android

Model card Files Files and versions Community

qaihm-bot commited on Mar 18

Commit

89c9537

•

1 Parent(s): fa36f97

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -36,8 +36,8 @@ More details on model performance across various devices, can be found
 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
 | ---|---|---|---|---|---|---|---|
-| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 2.184 ms | 0 - 2 MB | FP16 | NPU |  [EfficientNet-B0.tflite](https://huggingface.co/qualcomm/EfficientNet-B0/blob/main/EfficientNet-B0.tflite)
-| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 2.166 ms | 0 - 83 MB | FP16 | NPU |  [EfficientNet-B0.so](https://huggingface.co/qualcomm/EfficientNet-B0/blob/main/EfficientNet-B0.so)
 ## Installation
@@ -97,16 +97,16 @@ python -m qai_hub_models.models.efficientnet_b0.export
 ```
 Profile Job summary of EfficientNet-B0
 --------------------------------------------------
-Device: Samsung Galaxy S23 Ultra (13)
-Estimated Inference Time: 2.18 ms
-Estimated Peak Memory Range: 0.01-2.23 MB
 Compute Units: NPU (243) | Total (243)
 Profile Job summary of EfficientNet-B0
 --------------------------------------------------
-Device: Samsung Galaxy S23 Ultra (13)
-Estimated Inference Time: 2.17 ms
-Estimated Peak Memory Range: 0.01-82.84 MB
 Compute Units: NPU (242) | Total (242)
@@ -226,7 +226,7 @@ Explore all available models on [Qualcomm® AI Hub](https://aihub.qualcomm.com/)
 ## License
 - The license for the original implementation of EfficientNet-B0 can be found
   [here](https://github.com/pytorch/vision/blob/main/LICENSE).
-- The license for the compiled assets for on-device deployment can be found [here](https://qaihub-public-assets.s3.us-west-2.amazonaws.com/qai-hub-models/Qualcomm+AI+Hub+Proprietary+License.pdf).
 ## References
 * [EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks](https://arxiv.org/abs/1905.11946)

 | Device | Chipset | Target Runtime | Inference Time (ms) | Peak Memory Range (MB) | Precision | Primary Compute Unit | Target Model
 | ---|---|---|---|---|---|---|---|
+| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | TFLite | 2.174 ms | 0 - 2 MB | FP16 | NPU |  [EfficientNet-B0.tflite](https://huggingface.co/qualcomm/EfficientNet-B0/blob/main/EfficientNet-B0.tflite)
+| Samsung Galaxy S23 Ultra (Android 13) | Snapdragon® 8 Gen 2 | QNN Model Library | 2.173 ms | 0 - 83 MB | FP16 | NPU |  [EfficientNet-B0.so](https://huggingface.co/qualcomm/EfficientNet-B0/blob/main/EfficientNet-B0.so)
 ## Installation
 ```
 Profile Job summary of EfficientNet-B0
 --------------------------------------------------
+Device: Samsung Galaxy S24 (14)
+Estimated Inference Time: 1.52 ms
+Estimated Peak Memory Range: 0.01-67.59 MB
 Compute Units: NPU (243) | Total (243)
 Profile Job summary of EfficientNet-B0
 --------------------------------------------------
+Device: Samsung Galaxy S24 (14)
+Estimated Inference Time: 1.51 ms
+Estimated Peak Memory Range: 0.59-75.56 MB
 Compute Units: NPU (242) | Total (242)
 ## License
 - The license for the original implementation of EfficientNet-B0 can be found
   [here](https://github.com/pytorch/vision/blob/main/LICENSE).
+- The license for the compiled assets for on-device deployment can be found [here]({deploy_license_url})
 ## References
 * [EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks](https://arxiv.org/abs/1905.11946)