エンタープライズ

Remediation by Hand vs. Governed Remediation Fleets: A Cost-Per-Fix Breakdown

A cost-per-fix breakdown of manual remediation versus governed remediation fleets, where agents propose and humans authorize. Built from first principles.

Book a demo

Zof Reliability Team · エンジニアリング & プロダクト

2025年8月12日 · 読了時間 8 分 · 2025年8月12日更新

概要

Most platform teams can tell you their incident count and their mean-time-to-resolve. Almost none can tell you what a single fix actually costs. That blind spot is expensive, because the economics of remediation have quietly inverted: the work of writing a fix is shrinking while the work of *governing* it is growing. If you are still budgeting remediation as if a human writes every patch by hand, you are pricing the wrong line item. This is a cost comparison, not a manifesto. The question is narrow and answerable: what does it cost to resolve a defect by hand, what does it cost to resolve one through a governed remediation fleet where agents propose and humans authorize, and where does the crossover actually sit.

It is a pipeline of distinct labor, and most cost models only count the cheapest stage.
The composition of the work is shifting under the industry's feet.
A governed remediation fleet does not replace the engineer.

What a fix actually costs, broken into parts

A "fix" is not one activity. It is a pipeline of distinct labor, and most cost models only count the cheapest stage. Decompose mean-time-to-resolve into the work it actually contains:

Detect and triage. Confirm the defect is real, reproduce it, decide who owns it. This is mostly waiting and context-switching, not coding.
Locate. Find the responsible code across services, dependencies, and config. On a large system this is the longest single stage, and it scales with system complexity, not defect difficulty.
Author the change. Write the patch. This is the stage everyone pictures when they say "fix," and it is increasingly the smallest.
Validate. Prove the fix works and breaks nothing downstream. This is where cost hides, because incomplete validation is what produces the *next* defect.
Authorize and ship. Review, approve, merge, deploy, watch.

The labor cost of a manual fix is the loaded hourly cost of your engineers multiplied by the wall-clock time across all five stages, plus the carrying cost of the defect while it sits unresolved. The second term is the one finance never sees and engineering always feels: a reachable vulnerability or a latent regression accrues risk every hour it stays open.

The trap is optimizing only stage three. Faster authoring, including AI-assisted authoring, compresses the cheapest stage while doing nothing for locate, validate, or authorize. Worse, it can inflate them.

Why manual remediation is getting more expensive, not less

The composition of the work is shifting under the industry's feet. Roughly 41% of codebases are now AI-generated, and industry research suggests around 45% of AI coding tasks introduce a critical flaw or security issue. Read those two numbers together: the *supply* of code is exploding while the *defect rate per change* is climbing. More changes, each carrying more risk, all funneling into a remediation process that still runs at human pace.

Manual remediation does not scale against that curve for a structural reason. The authoring stage gets cheaper with better tools, but locate, validate, and authorize scale with system complexity and change volume, and those are exactly the dimensions that AI-generated code is inflating. You end up spending less time writing each fix and far more time figuring out what to fix, whether the fix is safe, and who is allowed to ship it.

There is a second tax. When remediation is slow and process-heavy, people route around it. Around 80% of developers admit to bypassing policy or guardrails when those guardrails get in their way. Every bypassed control is a fix that shipped without evidence, which becomes a future incident with its own cost-per-fix. Manual remediation does not just cost what it costs. It generates the rework that creates the next bill. This is the mechanism behind the headline figure that the cost of poor software quality now runs around $2.41 trillion, most of it is not the original defect, it is the remediation and re-remediation it spawns.

Governed remediation fleets: where the money moves

A governed remediation fleet does not replace the engineer. It changes which stages a human spends time on. The principle is fixed: agents propose, humans authorize. Unsupervised autonomous fixing is reckless; the governance, policy, approval, and audit, is the engineering work, not a wrapper around it.

Here is how the cost moves stage by stage:

Locate collapses. A live System Graph maps services, dependencies, and CI/CD into one change-aware model, so the fleet starts from the dependency context instead of grepping for it. The longest manual stage shrinks the most.
Authoring is proposed, not performed by hand. The fleet drafts the change. Your engineer's time moves from typing to judging.
Validation is built in, not bolted on. Coordinated Testing Fleets exercise the changed paths and what depends on them, so the proposed fix arrives carrying evidence rather than a hope.
Authorization concentrates. Instead of reviewing everything at the same depth, the human spends attention only where blast radius warrants it. Low-risk, fully-validated fixes are authorized fast; the dangerous minority gets real scrutiny.

The cost structure inverts. Manual remediation is mostly variable cost: every fix consumes a near-constant slice of engineer time across all five stages. Governed remediation shifts the heavy stages, locate and validate, toward fixed, amortized infrastructure cost, leaving humans a smaller, higher-leverage variable cost: the authorization decision and the genuinely hard fixes the fleet escalates.

### The cost-per-fix comparison

Stage	Remediation by hand	Governed remediation fleet
Detect / triage	Engineer hours, high context-switch tax	Fleet triages; human confirms
Locate	Longest stage; scales with complexity	Graph-driven; largely amortized
Author	Human writes every patch	Agent proposes; human edits the hard cases
Validate	Often skipped or partial	Change-aware evidence attached by default
Authorize / ship	Uniform review depth	Risk-tiered; attention where blast radius is real
Dominant cost type	Variable, per-fix	Fixed infra + thin variable human judgment
Failure mode	Bypassed controls, rework	Bad policy or stale graph, not bad code shipping silently

The comparison is not "humans versus no humans." It is "humans on every stage of every fix" versus "humans on the decisions that actually carry risk." The second is cheaper per fix *and* safer, because the safety lives in evidence and policy rather than in a tired reviewer's eyeball.

Where the crossover sits, and where manual still wins

Governed fleets are not free, and a serious cost model should say where manual remediation is still the right call.

Manual remediation wins when volume is low and the system is small enough that locate is cheap. If you ship a handful of changes a week against a system one person holds in their head, the fixed cost of standing up governed remediation will not pay back. The crossover arrives with scale and complexity: high change volume, many services, regulated surfaces, and a meaningful share of AI-generated code where the per-change defect rate is high. That is precisely the environment where manual locate-and-validate costs explode.

Two economic levers move the crossover earlier than most teams expect:

Reachability-based prioritization. Triaging by whether a flaw sits on an actually reachable path can mean 70 to 90% less exploitable exposure to remediate. You are not fixing fewer real problems; you are not paying to fix theoretical ones. That alone reprices your remediation backlog.
Rework avoided. Every fix that ships with real validation evidence is a future incident that never happens. The governed model's return is dominated by the rework it prevents, not the keystrokes it saves.

A caution: governed remediation introduces its own failure modes, and they are governance failures, not coding failures. A stale System Graph misjudges blast radius. A miscalibrated policy either over-escalates (recreating the bottleneck) or under-escalates (authorizing risk it should have paused). These are real costs, but they are *legible* costs, they live in Governance as policy and audit you can inspect and tune, not in an undocumented patch a developer pushed past the guardrail at 2 a.m.

What to do Monday morning

You cannot compare two models you have not measured. Start with instrumentation, not procurement.

Decompose your MTTR. For two weeks, tag each resolved defect with time spent in detect, locate, author, validate, and authorize. Most teams discover locate and validate dwarf authoring.
Price the bypass. Count fixes that shipped without validation evidence and trace which incidents traced back to them. That is your rework line item, and it is your real cost-per-fix.
Pick one high-volume, well-bounded surface. Run governed remediation on it where the fleet proposes and a human authorizes, and compare cost-per-fix against your manual baseline.
Tier authorization by blast radius, so human attention stops getting spent on changes that never needed it.

The bottom line

リリース準備状況 QA System Graph テスティングフリート修復フリート

続きを読む

エンタープライズ

Activity vs. Outcome: Why Your Reliability Metrics Are Measuring the Wrong Thing

Test counts and run volumes are activity theater. Here's why only outcome metrics, escaped defects and proven-safe releases, justify reliability investment.

Zof Reliability Team2026年6月17日読了時間 7 分

エンタープライズ

Reliability ROI for E-commerce: Measuring Confidence on Every Checkout Release

A case-study model for pricing avoided revenue loss on every checkout, payments, and inventory release, so product managers can defend reliability as ROI.

Zof Reliability Team2026年6月10日読了時間 7 分

エンタープライズ

Velocity Doesn't Kill Quality, Lack of Visibility Does

The speed-vs-quality tradeoff is a measurement failure, not a law of physics. Here's why full traceability across the reliability loop dissolves it.

Zof Reliability Team2026年6月9日読了時間 7 分

What a fix actually costs, broken into parts

Why manual remediation is getting more expensive, not less

Governed remediation fleets: where the money moves

Where the crossover sits, and where manual still wins

What to do Monday morning

The bottom line

続きを読む

Activity vs. Outcome: Why Your Reliability Metrics Are Measuring the Wrong Thing

Reliability ROI for E-commerce: Measuring Confidence on Every Checkout Release

Velocity Doesn't Kill Quality, Lack of Visibility Does

姿勢、操作、次に注意が必要なことを 1 つの面で確認できます。