Sicherheit & Governance

Governing Customer-Owned Agents: Control-Layer Patterns for Mixed Agent Fleets

A platform engineer's guide to governing mixed agent fleets: how one control plane authorizes your agents and vendor agents alike, without trusting either by default.

Book a demo

Zof Reliability Team · Engineering & Produkt

25. März 2026 · 8 Min. Lesezeit · Aktualisiert 25. März 2026

Zusammenfassung

Your stack is about to run agents you did not write. A network-automation vendor ships one. Your platform team builds three. A security partner's agent wants read-write access to remediate findings. Each arrives with its own credentials, its own idea of what it is allowed to do, and its own opinion of what counts as "done." For a platform engineer in telecom, where a single agent acting on the wrong OSS/BSS path can touch provisioning, billing, or a subscriber-impacting network function, the question is no longer whether to allow autonomous agents. It is how to authorize a fleet you only partly own, without trusting any of it by default. This is a different problem from governing your own automation. You can read your own agents' code, scope their permissions at build time, and trust your CI to gate them. You cannot do any of that for a vendor agent. So the governance has to move out of the agents and into a layer above them: one plane that decides what every agent, yours and theirs, is permitted to do, on what evidence, against which part of the system.

The instinct is to govern each agent where it runs: give the vendor agent its own service account, hope its sandbox holds, and trust its vendor's safety claims.
Before authorization, you need to know who is acting.
Once identity is established, the control plane decides what each agent may do.

Why a mixed fleet breaks per-agent trust

The instinct is to govern each agent where it runs: give the vendor agent its own service account, hope its sandbox holds, and trust its vendor's safety claims. That model fails for structural reasons, and the industry data explains why it fails harder every quarter.

Roughly 41% of codebases are now AI-generated, and around 45% of AI coding tasks introduce a critical flaw or security issue. An agent is, in effect, a fast machine-author of changes, so a fleet of them concentrates that defect rate and points it directly at production. When the author is a third party, you cannot review the code that produced the change, only the change itself. And behavior makes it worse: about 80% of developers bypass policy or guardrails when those guardrails add friction. A vendor's agent has even less incentive than your own engineers to respect a policy that lives in a document rather than in the system.

The deeper issue is that per-agent trust does not compose. Ten agents, each "reasonably safe" in isolation, produce a combined blast radius nobody scoped. Two agents can race on the same resource. A vendor agent can satisfy its own success criteria while violating yours. Trust placed in individual agents cannot be aggregated into trust in the fleet. You need a single point where authority is decided for all of them, on the same terms.

Identity first: no agent acts without an attested identity

Before authorization, you need to know who is acting. In a mixed fleet, "who" is the hardest part, because the actors are non-human, ephemeral, and partly external.

Treat every agent as an untrusted workload that must prove its identity before it does anything. That means no ambient credentials, no shared service accounts, and no long-lived API keys passed to a vendor and forgotten. Each agent, yours or theirs, presents a short-lived, attested credential bound to a specific identity, and the control plane records which agent took which action under which identity. This is the difference between an audit trail that says "the integration account modified the provisioning config" and one that says "vendor agent X, instance 7, acting under a credential issued at 14:02 and valid for nine minutes, proposed this change."

Two non-negotiables for telecom specifically:

Separate proposer from authorizer. An agent that both writes a change and applies it has collapsed the maker and the checker, exactly the separation of duties an auditor expects preserved on a regulated network. Every agent gets a propose-only default. It can plan, generate, and stage; it cannot move a change into a protected environment.
Vendor identities are first-class, not exceptions. The temptation is to wave a trusted partner through with broad access. Resist it. The vendor agent gets the same attested-identity requirement and the same propose-only default as everything else. A partner relationship is a commercial fact, not a security control.

Authorize by capability, not by trust level

Once identity is established, the control plane decides what each agent may do. The wrong axis is "how much do we trust this vendor." The right axis is capability scoped to blast radius: what can this specific agent touch, and what breaks if it is wrong?

This is where a live System Graph becomes the substrate for governance rather than a nice-to-have map. Because it models services, dependencies, and CI/CD as one change-aware graph, the control plane can answer the question that actually predicts risk: does this proposed action touch a node that fans out to a subscriber-facing path, a billing system, or a regulated data store? Capability is then granted against the graph, not against a vendor's reputation.

Make capability grants explicit and narrow:

Scope to a subgraph. A vendor's network-config agent gets capability over the network-function nodes it was bought to manage and nothing else. It cannot propose a change to billing because that path is not in its grant.
Scope to an environment. Bind each agent to where it is allowed to operate, a production-like staging tier, a PCI-segmented subnet, an isolated ERP sandbox, so a misbehaving agent cannot wander out of its lane.
Scope to an action class. Read-only observation, propose-with-validation, and propose-with-escalation are different capabilities. Most agents, especially vendor ones, should start at the lowest that still lets them do their job.

The control plane unifies this. Governance is where capability grants, policy checks, and the audit trail live as configuration that applies identically to a Zof-managed agent and a customer-owned or vendor one. One policy surface, many agents, no per-vendor side deals.

Evidence is the great equalizer

Here is the move that lets you govern agents you cannot inspect: stop judging agents by who built them, and judge every proposed change by the evidence attached to it. A change from your own agent and a change from a vendor's agent reach the gate carrying the same required artifact, what was exercised, what regressed, what the reachability analysis says about exposure, and the gate decides on that, not on pedigree.

This is only credible if the validation is real and change-aware. Coordinated Testing Fleets plan and execute validation against what actually changed and what depends on it, rather than running a static suite that ignores the dependency graph and proves nothing about the specific diff. For security gates, reachability matters: asking whether a flaw sits on a path that is actually reachable in your deployed system, rather than triaging every theoretical finding, can mean 70 to 90% less exploitable exposure. Applied to a mixed fleet, a vendor agent's change that introduces an unreachable issue does not have to block a release, while a reachable one routes straight to a human. You spend scarce review attention on real risk, regardless of which agent surfaced it.

Crucially, this equalizes incentives. A vendor agent cannot declare its own work complete and merge. It produces a proposal with evidence; your control plane decides. Agents propose, humans authorize, and the principle holds whether the proposer is on your payroll or your supplier's.

Failure modes to design against

A mixed-fleet control plane introduces its own ways to fail. Name them so your design accounts for them.

Trust laundering. A vendor agent is granted broad capability "because we trust the vendor," and that grant becomes the hole everything else walks through. Derive capability from the graph and policy, never from a relationship.
Stale graph, wrong scope. If the dependency map drifts, capability scoping misjudges blast radius and an agent reaches further than intended. The graph must be live and continuously reconciled.
Agent collisions. Two agents act on the same resource concurrently. The control plane must serialize or detect conflicting proposals on the same subgraph, not assume agents coordinate themselves.
Evidence the gate never reads. A change shows "tests passed" while validation never exercised the changed path. Read coverage *of the change*, not aggregate suite status.

When a vendor agent must run where your data cannot leave, inside a network operations boundary or a regulated enclave, keep the authority model intact by moving the runtime, not the trust. Edge Runners execute as signed capsules inside your perimeter and emit audit-ready evidence outward, so the data stays put and the proof comes to you. Remediation is the part to govern hardest; letting any agent, yours or a vendor's, fix a production network unsupervised is reckless, which is why governed Remediation Fleets keep apply behind the same authorization as everything else.

### What to do Monday morning

Inventory the actors. List every agent, internal, vendor, partner, that can already write to a protected environment. You will likely find more than you expected.
Issue attested identities. Replace shared accounts and long-lived keys with short-lived, per-agent credentials. No identity, no action.
Scope capability to the graph. Grant each agent a subgraph, an environment, and an action class. Start vendor agents at the narrowest grant that works.
Make the gate read evidence, not pedigree. Require the same validation artifact from every agent and route by reachability-based risk.

The bottom line

KI-Governance Menschliche Autorisierung System Graph Testing Fleets Remediation Fleets

Verwandte Leitfäden

Governed AI remediation

Verwandtes Produkt

Lesen Sie weiter

Sicherheit & Governance

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

A reference architecture for letting agents act on production safely: the four control surfaces, policy, approval, evidence, attribution, and how they wire into the loop.

Zof Reliability Team16. Juni 20268 Min. Lesezeit

Sicherheit & Governance

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Better code generation can't validate its own output. Why AI-written code needs a governed control layer that maps, tests, and proves every change.

Zof Reliability Team14. Mai 20267 Min. Lesezeit

Sicherheit & Governance

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

When 41% of your codebase has no author, the real risk isn't bugs, it's lost intent. How a System Graph restores the provenance AI-generated code strips away.

Zof Reliability Team5. Mai 20267 Min. Lesezeit

Why a mixed fleet breaks per-agent trust

Identity first: no agent acts without an attested identity

Authorize by capability, not by trust level

Evidence is the great equalizer

Failure modes to design against

The bottom line

Lesen Sie weiter

Agents Propose, Humans Authorize: A Reference Architecture for Governed Autonomy

More Models Won't Save You: Why AI-Generated Code Needs a Control Layer, Not Smarter Autocomplete

Code Without Provenance: The Real Risk When 41% of Your Codebase Has No Author

Eine Oberfläche für Körperhaltung, Operationen und alles, was als nächstes Aufmerksamkeit erfordert.