Papers
arxiv:2605.27466

AgensFlow: A Coordination-Policy Substrate for Multi-Agent Systems

Published on May 26
· Submitted by
Nicole Koenigstein
on May 28
Authors:

Abstract

AgensFlow is an open-source framework that treats multi-agent coordination as an online policy-learning problem under partial observability, enabling learned routing to improve coordination-heavy workflows over static approaches.

AI-generated summary

Multi-agent systems built on large language models (LLMs) require many coordination choices that are difficult to fix a priori: which skill protocol to invoke, which agent role should perform a subtask, which model to bind to each role, how roles should interact, when to use retrieval or verification, and when to omit a step entirely. These choices interact with task regime and operational constraints, so static pipelines and one-off model comparisons provide only a limited view of the design space. This paper introduces AgensFlow, an open-source framework that treats multi-agent coordination as an online policy-learning problem under partial observability. The framework makes coordination decisions observable and learnable from repeated trajectories, rather than treating skill, role, model, topology, and evaluation choices as fixed pipeline design. AgensFlow is evaluated on two corpora: distributed-systems incident tasks and security-advisory tasks. The evaluation shows three main results: learned routing reaches a higher-quality operating point than a fixed pipeline baseline on coordination-heavy classes; skip:X isolates topology compression as a meaningful part of the substrate; and warm-started policy graphs can reduce exploration cost while preserving plateau quality. Overall, the results support that learned, auditable routing can improve coordination-heavy multi-agent workflows over static wiring.

Community

Paper submitter

AgensFlow introduces an open-source coordination-policy substrate for multi-agent systems. Instead of hard-coding agent workflows upfront, the framework keeps the underlying models frozen and learns over the coordination graph: model × skill × role × topology, while also learning when to skip skills, models, and decisions.

The policy is based on inspectable state-action statistics rather than a black-box neural controller, which makes the coordination serializable and auditable.

The paper evaluates AgensFlow on distributed-systems incident tasks and security-advisory tasks, showing that learned routing improves over a fixed pipeline baseline on coordination-heavy task classes.

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2605.27466
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2605.27466 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2605.27466 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2605.27466 in a Space README.md to link it from this page.

Collections including this paper 1