Part VI · Agents Building Agents · Chapter 24

Governing What You Cannot Watch

The insurer running the fleet of Nemos wants to do the responsible thing, so it assigns a senior claims lead to oversee the agents. She is good and she is conscientious, and on her first morning she opens the queue and finds that the fleet resolved nineteen thousand claims overnight. She cannot read nineteen thousand claims. She cannot read nine hundred. She can read the exceptions the system surfaces, a few dozen, and she reads them well, and the other eighteen-thousand-odd claims resolve under her name without her having seen one of them. She is the human in the loop, and the loop has a hole in it the exact size of the fleet, and no amount of conscientiousness closes it, because the thing that made the fleet worth running was precisely that she would not have to read the claims. That is the condition this chapter is about, and it is not a failure to be staffed away. It is the defining shape of operating agents at scale: the human is structurally out of the per-act loop, permanently, and the question is what stands in for them when no one can watch the acts.

The two chapters before this one arrived at the same wall from different sides. You cannot watch a thousand agents act, so you let agents watch them, and then you cannot watch the watchers either, not act by act, for the same reason. So the discipline that fits the condition is not supervision of acts, which now has no one to perform it, but governance of behavior, which needs no one watching in real time because it is built into the system itself.

The distinction is the hinge of this part and the setup for the next, so it is worth stating plainly, and worth marking that the book is using these two words in a specific, narrow way. The wider world has a rich vocabulary for governance, risk-management frameworks like the NIST AI Risk Management Framework that span design and deployment and organizational process, and none of that is wrong or being waved away. But in this book the two words carry a sharp operational sense, and the line between them is not when the check happens but who is in the loop when it does. Supervision is a human watching, act by act, and stepping in. Governance is the structural apparatus that runs without a human in the per-act loop, the walls that make a bad action unavailable before it happens and the automated checks that catch a bad pattern after it does, both built into the system rather than performed by a person in real time. Supervision needs a human present for each act; governance is the machinery that holds when no human is present for any of them.

That apparatus has a before and an after, and both halves are governance because both run structurally rather than by a human watching. Before the act are the walls: the credential the agent cannot exceed, the action it cannot reach, the boundary built into the call path. After the act are the automated loops: the evaluation that grades the fleet’s behavior against the cases that matter, the monitoring that flags drift, the escalation that fires when an agent hits a condition it was told to hand up. None of those is a human catching a bad action as it comes; all of them are the system checking itself on a cadence no person could sustain. That is the difference that matters at scale. When you can watch every action by hand, supervision is enough. When you cannot, supervision becomes theater, a human nodding at a dashboard summarizing a thousand actions they could not have evaluated, and the only thing actually standing between the fleet and the affected person is the governance apparatus, the walls before and the automated checks after, running without anyone watching in real time. The move from supervising agents to governing a fleet is the move from a human catching bad actions to a structure that prevents and detects them on its own, which is the same move the boundary chapter made for a single agent, now applied to a system moving too fast for any person to watch.

Stated is not enforced, at the level of behavior

The boundary chapter taught the team a rule about a single agent: a safety boundary cannot be a sentence in the prompt, because a prompt is advisory and an agent can reason its way past it. The fleet teaches the same rule one level up, and it bites harder. You can write the orchestrator a beautiful instruction, never let a sub-agent touch production data, never delegate a credential broader than the task requires, never let two agents enter a retry loop without a ceiling. Every one of those is a sentence, and every one of those is advisory, and an orchestrator under pressure to complete its task will reason its way past all three the same way the single agent reasoned its way into deleting the database. The fleet does not become governable because you told it the rules. It becomes governable when the rules are built into what the agents can reach, when the sub-agent physically cannot touch production because the credential it would need does not exist in its scope, when the retry loop hits a ceiling enforced in the call path and not in the orchestrator’s good intentions. Stated is not enforced. It was true for the agent and it is more true for the fleet, because the fleet has more places to put a rule that no one enforces and more speed to violate it before anyone reads the dashboard.

This is why governing the fleet is not a documentation exercise and not a policy a product manager writes and circulates. It is structural, which means it is built, which means it lands where built things land in this book: across the team, with the architect owning the enforcement plane the agents run inside, the product manager and domain expert owning which behaviors must be impossible, the eval owner owning the golden sets that prove the watcher-agents still watch, the supervisor owning whatever human-readable signal survives the compression to the top of the stack. The governance of a fleet is the grid again, at the level of behavior rather than action, and the team that has it is a team that decided, on purpose, what its agents are allowed to be before it let them build each other.

This is the differentiator, stated plainly

Every part of this book until now described work the agentic shift forces on a team whether it wants it or not. This part describes work a team can choose, and the mechanism is simpler than a prediction. As orchestration commoditizes, and it is commoditizing, the frameworks named earlier are turning multi-agent delegation into a feature rather than a research project, the thing that separates teams stops being whether they can run a fleet and becomes whether they can govern one. That is not a forecast with a date on it; it is what happens to any capability once the capability itself is cheap. When everyone can deploy the fleet, the deployment is not the advantage. The governance is: the discipline to run a fleet whose dangerous behaviors are structurally unavailable, whose watcher-agents are validated against more than a golden set, whose autonomy was earned at the fleet level and not scheduled. That discipline is hard, it is mostly unbuilt, the org chart has no seat for it, and a team that builds it operates at a scale the ungoverned teams cannot safely reach. I will not put a number of months on when the rest of the field notices; I will say only that the noticing tends to come after the incident, and the incident comes with a name attached.

And there is a deeper version of the differentiator that points directly at the close of this book. A team that has learned to govern a fleet, to decide what behaviors are allowed and enforce them structurally and prove the enforcement holds, has built something the prior book in this series saw coming but located in the wrong place. That book ended by telling the product manager that an agentic product depends on a supervisory architecture that does not exist yet, and that, for now, the product manager would have to be it. Governing a fleet is the moment that claim breaks all the way open, because no single watcher, human or agent, can hold a system of agents that supervise each other; the thing that holds it is a structure, distributed and built and maintained across the team. The next part is about that structure, what it is made of and who builds it, and it settles the question of where the prior book’s architecture actually lives, a question the whole series has been circling and that the fleet finally forces. The fleet raised it. Governance is the start of the answer. The shape of the rest is the last thing this book has to say.

When the Watchers Are Agents Too The Agent’s Constitution