OmniParser icon_detect — MLX (YOLO11)

MLX weights of microsoft/OmniParser-v2.0's icon_detect — a YOLO11m fine-tuned on 67K screenshots to detect interactive UI elements. Runs on Apple Silicon with no PyTorch / no ultralytics at inference.

⚠️ License: AGPL-3.0

This model derives from Ultralytics YOLO11, which is AGPL-3.0 (strong copyleft). This repo is therefore licensed AGPL-3.0 — see LICENSE. If you build a network service on top of it, AGPL §13 requires you to offer users the corresponding source. For a commercial/proprietary product, consider an Ultralytics Enterprise License instead.

Provenance (corresponding source)

  • Original detector: microsoft/OmniParser-v2.0 icon_detect/model.pt (AGPL-3.0, Ultralytics YOLO11m).
  • MLX conversion format + inference code: yolo11-mlx.
  • omniparser_mlx.json: nc=1, class icon.

Usage

from yolo11_mlx import YOLO11   # pip install from github.com/walter-grace/yolo11-mlx
model = YOLO11("omniparser_mlx.npz")
boxes = model.predict("screenshot.png", conf=0.05, iou=0.5)[0].boxes.xyxy
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support