Sécurité et gouvernance

Separation of Duties for AI Agents: Who Proposes, Who Authorizes, Who Is Accountable

A CISO's framework for applying separation of duties to AI agents: why the proposing agent can never authorize its own change, and who stays accountable.

Book a demo

Équipe Fiabilité Zof · Ingénierie et produit

23 décembre 2025 · 8 min de lecture · Mis à jour le 23 décembre 2025

Résumé

Your auditors have spent decades enforcing one rule on the humans in your change pipeline: the person who initiates a transaction cannot be the person who approves it. Now autonomous agents are writing, testing, and proposing changes to the systems that price policies and pay claims, and most agent deployments quietly collapse that rule. The agent that proposes the fix is the same process that decides the fix is good and pushes it. For a CISO in insurance, that is not a productivity gain. It is an unsegregated duty waiting to surface in an exam. Separation of duties (SoD) is roughly fifty years old as a formal control, older than that as banking practice. It survived this long because it does not depend on anyone being trustworthy. It assumes the actor might be wrong or compromised and structures authority so a single bad actor cannot complete a sensitive action alone. That assumption transfers cleanly to agents, which are capable, fast, and reliably wrong a meaningful fraction of the time. This is a framework for applying SoD to AI agents without either neutering the automation or pretending oversight has been removed.

SoD is usually taught as two roles, maker and checker, the "four-eyes" principle.
The classic argument for SoD is collusion and fraud.
A checker that only reads what the maker handed it is theater.

Three roles the agent era keeps collapsing

SoD is usually taught as two roles, maker and checker, the "four-eyes" principle. Agent systems force a third role into the open, because automation makes it easy to lose track of who is answerable when something goes wrong. The clean model has three distinct seats:

Who proposes. The actor that initiates and constructs a change: writes the code, generates the test, drafts the remediation, assembles the evidence. This is the maker. In a modern stack it is increasingly an agent.
Who authorizes. The actor that decides the proposal may proceed into a protected environment. This is the checker, and it holds the authority the proposer does not. The authority is delegated and bounded by policy.
Who is accountable. The named human or role who answers for the outcome to an examiner, a board, or a regulator. Accountability does not move just because execution was automated.

The failure mode is collapsing any two of these into one actor. The most dangerous collapse is proposer-equals-authorizer: an agent that applies its own change. The quietest and most common collapse is authorizer-equals-accountable disappearing entirely, where automation runs and no human can say who owns the result. For an insurer, where a mispriced endorsement or a broken claims rule has direct policyholder and solvency consequences, the accountable seat can never be vacant or assigned to a system.

Why the proposing agent must never authorize itself

The classic argument for SoD is collusion and fraud. The agent-era argument is narrower and harder to wave away: the proposer's judgment about its own work is not independent evidence. An agent that generated a change and then certifies it passed validation is grading its own exam with an answer key it wrote.

The industry data makes this concrete rather than theoretical. Roughly 41% of codebases are now AI-generated, and around 45% of AI coding tasks introduce a critical flaw or security issue. The cost of poor software quality already sits near $2.41 trillion. You are not weighing whether to admit capable-but-fallible actors into your pipeline. They are already there, producing a defect rate you cannot prompt away. A better model lowers that rate; it does not give you a control. You cannot audit a probability, and you cannot let the thing that might be wrong be the thing that decides it was right.

There is also a security dimension specific to agents that SoD never had to handle for humans. A compromised or manipulated agent can produce a confident, well-formatted proposal complete with plausible-looking evidence. If the authorizing function trusts artifacts the proposer generated, a single compromised proposer defeats the whole control. That is why the checker must validate against an independent source of truth, not against the proposer's own claims.

Independent authorization needs independent evidence

A checker that only reads what the maker handed it is theater. Real separation requires the authorizing path to derive its facts from somewhere the proposer cannot author.

This is where a live System Graph becomes a control, not a convenience. Because it maps services, dependencies, and CI/CD into one change-aware model, the authorizing function can independently answer the question that actually governs risk: what does this change reach? A change to a quoting service that fans out to the binding path is a different risk class than the same diff on an internal reporting job, and the graph establishes that without trusting the proposer's self-description of blast radius.

Validation has to be independent in the same way. Coordinated Testing Fleets plan and execute validation that is aware of what changed and what depends on it, then emit the evidence the gate reads, which paths were exercised, what regressed, what reachability analysis says about exposure. That last signal matters for security authorization specifically: reachability-based prioritization, asking whether a flaw sits on a path that is actually reachable in your deployed system, can mean 70 to 90% less exploitable exposure to triage. The authorizer routes a reachable defect to a human and lets an unreachable one through on policy. Either way, the decision rests on evidence the proposer did not manufacture.

The principle underneath is the one your auditors already recognize: agents propose, humans authorize. The control layer can map, validate, and stage the change. It does not get to authorize the dangerous ones on its own behalf. Governance, policy, approval, and audit, is the engineering, not a wrapper added after the agent ships.

The accountability seat: encode it, don't assume it

Authorization decides whether a change proceeds. Accountability decides who answers for it afterward. Insurers learn the hard way that these are not the same when an examiner asks who approved a rule that underpaid a class of claims, and the honest answer is "an automation pushed it and no one is named."

Accountability has to be encoded as policy, not left to org-chart inference:

Every protected change carries a named authorizer by role, and that role is provably distinct from whatever proposed the change. The proposer cannot appear in the approver set.
The accountable owner is recorded at decision time, bound to the proposal, the evidence, and the System Graph context the decision rested on, one linked, immutable artifact rather than a reconstruction.
The trail proves the negative. An auditor's real test is not "do you have logs," it is "can you prove this specific change was authorized by someone permitted to authorize it, on evidence that existed before approval, and that the control was not bypassed."

That last point is where most programs fail. Roughly 80% of developers admit to bypassing policy or guardrails when they add friction. A separation of duties that lives in a wiki or a change-advisory meeting gets routed around at exactly the moment it matters. The control has to be a property of the system, where it cannot be skipped, with the audit trail as a byproduct of how the pipeline runs.

For workloads that cannot leave your perimeter, and in insurance, policyholder and claims data rarely can, the authority model has to hold inside your boundary. Edge Runners execute as signed capsules inside a secure enclave and emit audit-ready evidence outward, so residency and separation of duties stop being a tradeoff. The data stays; the proof leaves.

A note on the hardest case: remediation. Letting agents fix code unsupervised is reckless precisely because it is the step where proposing and authorizing are most tempting to merge. Governed Remediation Fleets stage the fix and the evidence but never self-authorize a change to a regulated or revenue-critical path.

What to do Monday morning

You do not need a platform rebuild to restore separation. Start with mapping, then enforcement.

Find every place an agent can write to a protected environment today. This is your unsegregated-duty inventory. It is usually larger than expected.
Build a three-seat RACI for changes. For each sensitive path, name who proposes, who authorizes, and who is accountable, and confirm no actor holds two seats.
Set agents to propose-only by default, and require an explicit, role-checked human authorization for any reachable, regulated, or revenue-critical change.
Make the authorizer read independent evidence, graph-derived blast radius and change-aware validation, never the proposer's own attestations.
Bind the accountable owner into the record so a future exam answers in minutes, not weeks.

The bottom line

Gouvernance de l'IA Autorisation humaine System Graph Flottes de test Flottes de remédiation

Guides associés

Governed AI remediation

Produit associé

Continuer la lecture

Sécurité et gouvernance

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

A reference architecture for letting agents act on production safely: the four control surfaces, policy, approval, evidence, attribution, and how they wire into the loop.

Équipe Fiabilité Zof16 juin 20268 min de lecture

Sécurité et gouvernance

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Better code generation can't validate its own output. Why AI-written code needs a governed control layer that maps, tests, and proves every change.

Équipe Fiabilité Zof14 mai 20267 min de lecture

Sécurité et gouvernance

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

When 41% of your codebase has no author, the real risk isn't bugs, it's lost intent. How a System Graph restores the provenance AI-generated code strips away.

Équipe Fiabilité Zof5 mai 20267 min de lecture

Three roles the agent era keeps collapsing

Why the proposing agent must never authorize itself

Independent authorization needs independent evidence

The accountability seat: encode it, don't assume it

What to do Monday morning

The bottom line

Continuer la lecture

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

Une surface pour la posture, les opérations et ce qui nécessite une attention particulière.