AI & ML interests

UI-MOPD

Recent Activity

EliSpctre  updated a dataset 37 minutes ago
UI-MOPD/Uni-GUI-OpenMobile
EliSpctre  published a dataset 37 minutes ago
UI-MOPD/Uni-GUI-OpenMobile
EliSpctre  updated a dataset about 1 hour ago
UI-MOPD/AndroidControl-Star
View all activity

Organization Card

UI-MOPD: Multi-platform On-Policy Distillation for Continual GUI Agent Learning

We build cross-platform GUI agents that can operate both desktop and mobile interfaces through a unified training framework.

Research

UI-MOPD introduces a two-stage training pipeline:

  • Stage 1: Supervised Fine-Tuning (SFT) on platform-specific teacher models
  • Stage 2: Reinforcement Learning distillation (DAPO) with multi-teacher on-policy guidance

Our student model (8B) learns from multiple 32B teacher models to achieve strong cross-platform GUI interaction capabilities.

Models

ModelSizeDescription
Qwen3-VL-32B-Thinking-Desktop-Teacher33BDesktop platform teacher
Qwen3-VL-32B-Thinking-Mobile-Teacher33BMobile platform teacher
Qwen3-VL-8B-Thinking-Desktop-SFT9BDesktop SFT checkpoint
Qwen3-VL-8B-Thinking-Mobile-SFT9BMobile SFT checkpoint
Qwen3-VL-8B-Thinking-UI-MOPD-Student9BFinal cross-platform student

Datasets

DatasetDescription
Uni-GUI-OpenCUAPost-processed desktop trajectories from OpenCUA (~832 episodes, ~14K steps)
Uni-GUI-Desktop-1Large-scale desktop GUI trajectories (~2.7K episodes, ~36K steps)

Links