Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
maxiwΒ 
posted an update Nov 19
Post
1143
πŸ€– Controlling Computers with Small Models πŸ€–

We just released PTA-1, a fine-tuned Florence-2 for localization of GUI text and elements. It runs with ~150ms inference time on a RTX 4080. This means you can now start building fast on-device computer use agents!

Model: AskUI/PTA-1
Demo: AskUI/PTA-1

awesome stuff!

In this post