Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
vladbogo 
posted an update Feb 13
Post
OS-Copilot is a new framework for creating computer agents such as FRIDAY. This framework enables agents to interact seamlessly with your operating system, handling tasks like file management, multimedia editing, and more.

The system has three components:
* Planner: It takes complex user requests and breaks them down into manageable subtasks for efficient execution.
* Configurator: It prepares tasks for execution based on your preferences and available commands using a memory mechanism.
* Actor: It executes the tasks and learns from feedback, ensuring continuous improvement.

FRIDAY outperforms other methods on GAIA, a comprehensive benchmark. To answer the questions from GAIA, the agents need skills to calculate numbers, browse the web, process video and speech signal and others.

Resources:
* Paper: OS-Copilot: Towards Generalist Computer Agents with Self-Improvement (2402.07456)
* Project GitHub: https://github.com/OS-Copilot/FRIDAY
* Project page: https://os-copilot.github.io/

Congrats to the authors Wu, Zhiyong et al. for their work!
In this post