LEO-UI7M v0.1

Lightweight DOM-based browser agent. 7.4M parameters, CPU-only, no screenshots required.
Reads page DOM structure, predicts the next UI action (click, type, navigate).

Quick Start

pip install -r requirements.txt
python examples/run_agent.py --task "find the price of item X"

Model

  • Architecture: DOM-aware action predictor
  • Parameters: 7,458,567
  • Weights: model.safetensors
  • GPU required: No
  • Input: raw DOM text + task instruction
  • Output: UI action (element + action type)

Validated Capabilities

  • Wikipedia command β†’ research note
  • Python docs command β†’ structured docs results
  • Hacker News jobs command β†’ job digest
  • Page explanation from URL
  • Refusal for unsafe or out-of-scope tasks

Limitations

  • Not a general browser automation replacement
  • Not screenshot-based (DOM text only)
  • Not for arbitrary web automation
  • Best on pages with clean, semantic DOM structure
  • Trained on limited data β€” generalizes to simple, well-structured pages

Files

model.safetensors     β€” model weights
config.json           β€” model config
leo_ui7m/             β€” Python package (runtime, adapters, agent)
examples/run_agent.py β€” usage example
requirements.txt      β€” dependencies
tests/                β€” test suite

License

MIT

Downloads last month
22
Safetensors
Model size
7.46M params
Tensor type
F32
Β·
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support