Update README.md
Browse files
README.md
CHANGED
@@ -185,4 +185,19 @@ print(output_text)
|
|
185 |
# {'action': 'ENTER', 'value': None, 'position': None}
|
186 |
```
|
187 |
|
188 |
-
data:image/s3,"s3://crabby-images/aed57/aed579d5170fef493a1c003c5dd655761d7df57d" alt="download"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
185 |
# {'action': 'ENTER', 'value': None, 'position': None}
|
186 |
```
|
187 |
|
188 |
+
data:image/s3,"s3://crabby-images/aed57/aed579d5170fef493a1c003c5dd655761d7df57d" alt="download"
|
189 |
+
|
190 |
+
|
191 |
+
If you find our work helpful, please consider citing our paper.
|
192 |
+
|
193 |
+
```
|
194 |
+
@misc{lin2024showui,
|
195 |
+
title={ShowUI: One Vision-Language-Action Model for GUI Visual Agent},
|
196 |
+
author={Kevin Qinghong Lin and Linjie Li and Difei Gao and Zhengyuan Yang and Shiwei Wu and Zechen Bai and Weixian Lei and Lijuan Wang and Mike Zheng Shou},
|
197 |
+
year={2024},
|
198 |
+
eprint={2411.17465},
|
199 |
+
archivePrefix={arXiv},
|
200 |
+
primaryClass={cs.CV},
|
201 |
+
url={https://arxiv.org/abs/2411.17465},
|
202 |
+
}
|
203 |
+
```
|