Tutorial

Manage autonomous reliability in Console

SRE and EM workflow spanning runs, test health, releases, and remediation.

Overview

SRE and EM workflow spanning runs, test health, releases, and remediation.

Tutorial details

Audience
Engineering manager / SRE
Duration
45 min
Prerequisites
Existing project with run history

Tutorial steps

Review reliability posture

Home metrics and Reports overview.

Navigation: Console Home quick actions or the project wizard for new initiatives. Use ⌘K / Ctrl+K to jump to any surface.

Verification: Confirm organization and team context in the Console header before making changes.

Analyze Test Health

Identify flakiness and failure clusters.

Navigation: Console Home quick actions or the project wizard for new initiatives. Use ⌘K / Ctrl+K to jump to any surface.

Verification: Note project, run, or agent IDs if you may need support escalation.

Evaluate release gates

Releases → gate status before ship.

Navigation: Console Home quick actions or the project wizard for new initiatives. Use ⌘K / Ctrl+K to jump to any surface.

Verification: Confirm UI state matches your runbook. Retry once on transient errors before opening a ticket.

Review remediation queue

Remediation → approvals if policies enabled.

Navigation: Console Home quick actions or the project wizard for new initiatives. Use ⌘K / Ctrl+K to jump to any surface.

Verification: Confirm UI state matches your runbook. Retry once on transient errors before opening a ticket.

Expected outcome

Operational reliability workflow understood across Console areas.

After completing this tutorial

  • Capture run IDs and screenshots for your team runbook
  • Share learnings with QA, SRE, or platform stakeholders
  • Proceed to related how-to guides for operational hardening

Continue learning

Was this page helpful?

Manage autonomous reliability in Console | Zof AI Documentation