metadata

license: unknown

适用于RKNN2的Segment Anything模型

Segment Anything Model for RKNN2 (English readme see below)

安装RKNPU2 2.0.0b23版本运行库
在开发板上安装python-opencv, rknn-toolkit-lite2, onnxruntime等
从 https://huggingface.co/happyme531/segment-anything-rknn2 下载模型文件(sam_vit_b_01ec64.pth.encoder.patched.onnx.rknn,sam_vit_b_01ec64.pth.decoder.onnx)
执行run_sam_rknn.py即可

输入:

提示：

{"type": "point", "data": [540, 512], "label": 1}

输出:

性能: RK3588，单NPU核心，耗时约22000ms
..性能瓶颈: Softmax太大，NPU无法执行

编辑convert_encoder.py, 修改模型路径:

ONNX_MODEL="sam_vit_b_01ec64.pth.encoder.onnx"

Install RKNPU2 2.0.0b23 version runtime library
Install python-opencv, rknn-toolkit-lite2, onnxruntime, etc. on the development board
Download model files from https://huggingface.co/happyme531/segment-anything-rknn2 (sam_vit_b_01ec64.pth.encoder.patched.onnx.rknn, sam_vit_b_01ec64.pth.decoder.onnx)
Execute run_sam_rknn.py

Input:

Prompt:

{"type": "point", "data": [540, 512], "label": 1}

Output:

Performance: RK3588, single NPU core, takes about 22000ms
..Performance bottleneck: Softmax is too large, NPU cannot execute

Edit convert_encoder.py, modify the model path:

ONNX_MODEL="sam_vit_b_01ec64.pth.encoder.onnx"

Execute convert_encoder.py
Now it will output an rknn file, but its execution speed is very slow (~120s) because the model structure needs adjustment
Execute patch_graph.py, which will generate an adjusted onnx file
Edit convert_encoder.py again, modify the model path, and execute the conversion
The decoder model runs quickly, so there's no need for conversion. It can be run directly using onnxruntime CPU.