Does this replace our existing CI/CD, Jira, and Slack?

No. The System Graph ingests metadata from the tools you already run and Zof integrates with existing CI/CD, Jira, and Slack rather than replacing them. The graph adds the change-to-impact relationships your current pipeline does not model, and pushes evidence and approvals back through the channels your teams already use.

Isn't attributing changes to developers and AI tools just surveillance?

The intent is calibration of systems and tools, not individual scoring. Attribution links a change to its author and authoring tool so you can see whether a given source of change tends to escape defects, which is a guardrail and procurement decision. Humans set policy through the governance layer, and the audit trail records who authorized what, not a productivity ranking.

Which metrics should we report to leadership instead of coverage?

Lead with time-to-reproduce and escaped-defect rate by service, plus the share of checks justified by change impact rather than a fixed suite. These measure whether the loop is actually getting safer. Raw test counts and total coverage measure activity and can rise while real risk is untouched.

Where do we start without a multi-quarter rollout?

Model your highest-risk workflows in the System Graph, then run targeted validation on one service so each merge carries its own impact map and evidence. Compare escaped-defect rate and time-to-reproduce against your current baseline over six to eight weeks before expanding surfaces and policies under governance review.

Operaciones de fiabilidad

Velocity Doesn't Kill Quality. Lack of Visibility Does.

The speed-versus-quality tradeoff is a measurement error. The real variable is visibility into change impact.

Book a demo

Equipo de Fiabilidad de Zof · Ingeniería y producto

3 de junio de 2026 · 13 min de lectura · Actualizado 16 de junio de 2026

The tradeoff that was never real

The default story in most engineering organizations is that speed and quality trade against each other. Ship faster and you accept more defects. Slow down and you buy back stability. Leaders are asked to choose a point on that curve and defend it in quarterly reviews.

The curve is an artifact of poor instrumentation, not a law of software. Teams feel the tradeoff because they cannot see what a fast change actually touches. The cost of velocity is invisible until production surfaces it, so velocity gets blamed for a failure of visibility.

Reframed correctly, the question is not how fast to go. It is how clearly the organization can see the consequences of going fast, in time to act on them.

What going fast actually erodes

Velocity does not directly create bugs. It erodes the shared mental model of how the system fits together. When a service ships ten times a day, no single engineer holds an accurate picture of which workflows depend on it, which contracts it satisfies, or which incidents it has caused before.

That eroded model is the real failure mode. A change looks local to its author and turns out to be global in effect. The defect was authored in confidence precisely because the blast radius was unknowable at the moment of the merge.

With AI-assisted development this is sharper. Zof's research puts AI-generated code at roughly 41% of codebases, which means a growing share of changes arrive without a human who can explain, from memory, everything they touch.

A green pipeline is a verdict, not visibility

Most teams already have CI, and most CI emits a single bit: pass or fail. That bit is a verdict on a suite, not an account of a change. It cannot tell you which workflows this merge put at risk, why each check was selected, or what was deliberately left uncovered.

A passing run on an unrelated suite feels like safety and is not. A failing run on a flaky test feels like risk and is not. In both cases the signal is opaque about the thing that matters: the relationship between this specific change and the parts of the system that depend on it.

Pass/fail tells you the suite is happy. It does not tell you the change is safe. Those are different questions, and only one of them is the one your customers feel.
— Zof engineering

The six dimensions of real traceability

Visibility is not one fact. It is a chain that connects a change to its consequences and to the people and tools accountable for it. Each link answers a question a skeptical reviewer would ask, and the chain only holds if every link is present.

What full traceability records for every change

What changed: the diff, the services and contracts it modifies, the data it moves.
What it impacts: the downstream workflows, dependencies, and environments reached through the System Graph.
Which checks ran: the targeted validation selected for this change, not a fixed suite.
Why they ran: the impact and risk rationale that justified each check.
The evidence: traces, artifacts, and failure signatures attached to the change itself.
Who and what authored it: the developer, the reviewer, and the AI tool that generated or assisted the code.

The last dimension is the one teams skip and the one that matters most over time. If you cannot attribute a change to an author and an authoring tool, you cannot learn which sources of change help or hurt reliability.

Change impact, made visible

The unit of visibility is the fan-out of a single change. A one-line edit to a service is never one line of consequence. It propagates along dependency edges to other services, into the user and batch workflows that cross those services, and against the history of incidents that touched the same nodes.

Change impact fan-out for one merge

Change in service A (auth token refresh)
  ├─ dependent service B  -> targeted contract + API checks
  ├─ workflow: checkout    -> UI + integration fleet run
  ├─ workflow: login       -> session + regression checks
  └─ prior incident #1184  -> replay reproduction case
          |
          v
   evidence + owner + authoring tool attached to the change

The same fan-out that drives targeted validation also defines what visibility must cover.

The System Graph as the visibility layer

The data that produces this fan-out has to live somewhere durable. In Zof that place is the System Graph: a living map of services, workflows, dependencies, tests, incidents, environments, and change impact, assembled from the systems you already run rather than a snapshot that rots overnight.

The graph is what turns full regression into targeted validation with a rationale. When a change lands, it computes the affected nodes and hands that scope to Testing Fleets, which run the checks that matter and write evidence back to the nodes they touched. We make the deeper case for this in the System Graph as the foundation of software reliability.

Opaque CI signal versus graph-backed visibility

Two ways to answer "is this change safe to ship?"

Question a reviewer asks	Opaque CI signal	Graph-backed visibility
What did this change touch?	Unknown; the suite is change-agnostic	Impacted services and workflows, computed per change
Why did these checks run?	They always run; no per-change rationale	Selected from impact and risk, with reasons recorded
What was left uncovered?	Implicit and undocumented	Explicit open risks listed against the change
Where is the proof?	Logs buried in a CI job	Traces and artifacts attached to the change itself
Who and what authored it?	Not correlated with outcomes	Developer and AI tool linked to reliability signal
Is it release-ready?	"The build is green"	Evidence that critical workflows are validated

Accountability without blame

Once authorship is part of the record, a sensitive capability appears: you can see which developers and which AI tools tend to help or hurt quality. This is easy to weaponize and that would be a mistake. The goal is calibration, not a leaderboard.

Used well, attribution is a feedback loop. If one code-generation tool ships changes that escape to production at a higher rate, that is a procurement and guardrail decision, not a performance review. Zof's research finds roughly 45% of AI coding tasks introduce a critical security flaw and around 80% of developers bypass security policy; you cannot manage either pattern if you cannot see it.

Accountability lives at the level of systems and tools, with humans setting policy. That is consistent with how the rest of the platform works: agents propose, humans authorize, and the governance layer keeps the audit trail.

The metrics that actually matter

Visibility changes which numbers you should track. Test counts and coverage percentages measure activity, not safety. A suite can have ten thousand tests and zero of them aligned to the workflow that just broke.

Two metrics describe the real state of the loop. Time-to-reproduce measures how fast you can turn a production symptom back into a deterministic failure. Escaped-defect rate measures how often a change reaches customers before validation catches it. Both improve directly when change impact is visible, because targeting gets sharper and reproduction draws on incident memory in the graph.

Track these; retire the vanity metrics

Time-to-reproduce: production symptom back to a deterministic failure case.
Escaped-defect rate: defects that reach customers per release, by service.
Targeted-check ratio: checks justified by impact versus checks that always run.
Authoring attribution coverage: share of changes with a known author and tool.
Stop tracking raw test counts and total coverage as proxies for safety.

Why fast and safe stop competing

When every merge carries its own impact map, targeted checks, evidence, and authorship, the reviewer's question "is this safe?" has an answer that takes seconds, not a meeting. Speed is no longer paid for with uncertainty.

This is what a closed reliability loop looks like in practice: Understand, Test, Reproduce, Remediate, Verify, run continuously over the graph. A walkthrough of one such run is in inside a Zof run, and the operating model behind it is described in our guide to autonomous reliability infrastructure.

One published proof point frames the upside without promising it: a Series C fintech VP of Engineering reported 94% fewer production incidents within 90 days. The mechanism was not slowing down. It was seeing change impact clearly enough to keep shipping.

Final takeaway

Velocity is not the variable to control. Visibility is. The teams that ship fast and safely are not the ones that found a magic point on the speed-quality curve; they are the ones that made change impact legible, check by check, change by change.

Build the visibility layer first. With a System Graph underneath, fast becomes the safe default, and the tradeoff your last reorg was built to manage quietly disappears.

Preguntas frecuentes

: No. The System Graph ingests metadata from the tools you already run and Zof integrates with existing CI/CD, Jira, and Slack rather than replacing them. The graph adds the change-to-impact relationships your current pipeline does not model, and pushes evidence and approvals back through the channels your teams already use.

Preparación para la publicación System Graph SRE QA

Guías relacionadas

System Graph for reliability

Producto relacionado

System Graph

Continuar leyendo

Producto

Por qué la fiabilidad del software necesita un System Graph

Los agentes de fiabilidad necesitan contexto. Un System Graph permite una validación enfocada, la puntuación de riesgo y una reproducción de incidentes más rápida.

Equipo de Fiabilidad de Zof7 may 202611 min de lectura