Stevenshuqing/gui360-balanced
Viewer • Updated • 2.57k • 72
Cooperative LoRA (r=128, T=2 comm rounds) with SVD initialization from full SFT. TSR=18.6%, StepSR=65.3%. ~67M params. Starting checkpoint for RL.
| Metric | Value |
|---|---|
| TSR (Task Success Rate) | 18.6% |
| StepSR (Step Success Rate) | 65.3% |
| Progress | 31.0% |
| # | Method | TSR | Params |
|---|---|---|---|
| 1 | Full-param SFT step-250 | 22.2% | 7.6B |
| 2 | V15 Cooperative RL step-25 | 20.8% | ~142M |
| 3 | PEFT Cooperative r=128 (SVD) | 18.6% | ~67M |
| 4 | PEFT Standard r=128 (SVD) | 18.1% | ~67M |
| 5 | Base model (zero-shot) | 2.4% | — |
Part of the Cooperative LoRA research for GUI agents.
Base model
Qwen/Qwen2.5-VL-7B-Instruct