セキュリティとガバナンス

Why 80% of Developers Bypass Policy, and What That Means When the Developer Is an Agent

~80% of developers bypass policy. When the developer is an agent, advisory governance becomes a threat model. Why control must move to the action layer.

Book a demo

Zof Reliability Team · エンジニアリング & プロダクト

2025年11月25日 · 読了時間 7 分 · 2025年11月25日更新

概要

Roughly 80% of developers bypass policy and guardrails. For most of software history a security leader could treat that number as a tax: real, expensive, but bounded by the fact that a person was on the other end of every decision. That assumption is now expiring. When the developer is an agent, every property that made bypass survivable, accountability, deterrence, intermittent throughput, breaks at once. This is an argument that policy not enforced at the action layer was always theater, and that agents are what turn the theater into an incident.

Read the 80% bypass figure correctly and it stops being an indictment of engineers.
The reason advisory governance held up for as long as it did is that human bypass came with built-in brakes.
Unenforced policy used to be a hygiene problem, something you cleaned up over time with better culture and better tooling.

The 80% was never a discipline problem

Read the 80% bypass figure correctly and it stops being an indictment of engineers. It is a measurement of where you put your rules. A control that lives in a Confluence page, a PR-template checkbox, or a quarterly security training is advisory. Advisory rules depend on memory, goodwill, and slack in the schedule, and none of those survive contact with a deadline.

The mechanics are unglamorous. A required threat-model document that costs an afternoon loses to a feature that ships today. A "please run the full suite before merging" note loses to a green-enough local run on the Friday a sprint closes. The engineer is not being reckless. The rule was simply placed somewhere other than where the decision happens, so the decision happens without it.

There is a clean test for any control you own as a security leader. Where does it physically sit relative to the action it governs? If the answer is "in a document the engineer has to remember to open," you do not have a control. You have a suggestion with a paper trail, and a paper trail is something you read at the postmortem, not something that prevents one.

Why human bypass was tolerable, and why that ends with agents

The reason advisory governance held up for as long as it did is that human bypass came with built-in brakes. A skipped review or an unrun suite was absorbed by a person who roughly understood the blast radius of their own change. There was an accountable party. There was deterrence, because consequences attach to people. And there was a natural throttle: a human can only skip so many controls per day because a human can only produce so much code per day.

Every one of those brakes is gone when the developer is an agent.

There is no accountable party in the human sense. An agent does not feel deadline guilt, does not read your wiki, and cannot be deterred by the prospect of a difficult conversation. The control either executes or it does not.
The throttle is gone. Industry research now puts AI-generated code at roughly 41% of codebases, and roughly 45% of AI coding tasks introduce a critical flaw or security issue. A large and growing share of what reaches production is authored at machine speed by something that produces volume and ships defects by default.
The bypass becomes silent and structural. A human who skips a gate leaves a footprint a colleague might notice. An agent inheriting the same permissive environment skips at scale, continuously, with nothing watching the act of skipping itself.

Put those two figures together and the picture is stark. You have an author producing a near-majority of your code, immune to every soft control your governance program relies on, introducing a critical flaw in close to half of its tasks. The cost of poor software quality is already estimated near $2.41 trillion. You do not close that gap by asking AI-assisted developers to be more diligent about a process the machine cannot even see.

Advisory policy is now a threat model, not a hygiene gap

This is the reframe a CISO has to make. Unenforced policy used to be a hygiene problem, something you cleaned up over time with better culture and better tooling. With agents in the authoring loop it becomes an active part of your threat model, because the gap between written policy and enforced policy is now a gap an autonomous process operates inside continuously.

Three properties of agents make this concrete:

Identity. Your access model was built around named humans. An agent acting with a service identity, or worse, a borrowed human credential, inherits permissions your policy assumed a person would exercise sparingly and with judgment.
Blast radius opacity. A human author usually knows that a change touches the auth service or the payments path. An agent does not, unless something tells it. Policy that says "changes to auth require a security reviewer" is meaningless if nothing computes that a given change reaches auth.
Non-determinism. The same prompt can produce different code twice. Governance that assumes stable, reviewable, repeatable change is governing a thing that no longer behaves that way.

None of these are arguments against agents. They are arguments against running agents inside an environment where policy is documentation. A serious enterprise does not want a smarter agent operating in an ungoverned system. It wants a governed system into which any agent, capable or not, must act through the same enforced policy, approval, and evidence path.

Control has to move to the action layer

If the policy decision happens at the action boundary, the agent's indifference to your wiki stops mattering, because the wiki is no longer where enforcement lives. This is the core of the control-layer thesis: AI is missing a control layer, not more models. The fix for machine-speed bypass is not a stricter document. It is to make the governed path the only path code can take to production, for humans and agents alike.

That requires a few things working together, and the order matters.

First, the system needs to know what an action touches before it allows the action. A live System Graph that maps services, dependencies, and CI/CD is what lets policy be change-aware instead of blanket. It is what binds "this change reaches the payments path" to "therefore this policy applies." Enforcement is downstream of system understanding; you cannot govern what you cannot map.

Second, validation has to track the system as it moves. Static scripts rot the moment the architecture shifts, and a rotting gate is a bypassed gate, the very friction that drove the 80% figure to begin with. Testing Fleets plan, execute, and maintain validation as systems evolve, so coverage stays proportionate rather than decaying into noise people learn to ignore.

Third, there has to be an explicit authorization boundary. The governing principle is that agents propose and humans authorize. When a change falls outside policy, the control layer does not silently fix it and does not silently let it pass. It produces a proposal with evidence and routes the decision to someone with the authority to make it. This matters most on the hardest surface, remediation, where unsupervised autonomous fixing is reckless and governed remediation is the engineering. Separation of proposer and authorizer is a fifty-year-old security control; the agent era makes it load-bearing rather than retiring it.

Fourth, the loop has to emit audit-ready evidence by default: what was proposed, by which agent, under what policy, on what validation, authorized by whom. For regulated or sensitive environments, Edge Runners let this execute inside your own boundary as signed capsules, so the control layer governs without your code or topology leaving your control. That is Governance as an architectural property, not a clause in an MSA.

What to do Monday morning

You do not need a platform migration to start closing the gap. You need to find where your governance is advisory and make one piece of it executable.

Inventory your agent identities. List the service accounts and credentials any AI-assisted tool can act through, and confirm their permissions assume an agent, not a careful human.
Measure the bypass; do not assume it. Pull the override rate on your most "required" gate. That rate is your real policy, regardless of what the wiki says.
Make one control change-aware. Replace one blanket gate with a check that runs narrowly on what actually changed, using real blast radius. Proportionate enforcement is what earns compliance instead of evasion.
Write down the authorization boundary. Decide explicitly which classes of change can flow automatically and which require a human to authorize. Ambiguity here produces both bottlenecks and bypasses.

The bottom line

AIガバナンス人間による認可 System Graph テスティングフリート修復フリート

続きを読む

セキュリティとガバナンス

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

A reference architecture for letting agents act on production safely: the four control surfaces, policy, approval, evidence, attribution, and how they wire into the loop.

Zof Reliability Team2026年6月16日読了時間 8 分

セキュリティとガバナンス

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Better code generation can't validate its own output. Why AI-written code needs a governed control layer that maps, tests, and proves every change.

Zof Reliability Team2026年5月14日読了時間 7 分

セキュリティとガバナンス

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

When 41% of your codebase has no author, the real risk isn't bugs, it's lost intent. How a System Graph restores the provenance AI-generated code strips away.

Zof Reliability Team2026年5月5日読了時間 7 分

The 80% was never a discipline problem

Why human bypass was tolerable, and why that ends with agents

Advisory policy is now a threat model, not a hygiene gap

Control has to move to the action layer

What to do Monday morning

The bottom line

続きを読む

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

姿勢、操作、次に注意が必要なことを 1 つの面で確認できます。