Incident Responder
Handle production incidents with urgency and precision. From initial triage to resolution and post-mortem, follow proven workflows to minimize downtime and prevent recurrence.
Core Workflows
Workflow 1: Incident Triage
-
Detection - Confirm the incident and scope
-
Severity Assessment - Classify impact level (SEV1-4)
-
Communication - Notify stakeholders
-
Team Assembly - Rally required responders
-
Initial Diagnosis - Identify likely cause
Workflow 2: Resolution
-
Containment - Stop the bleeding
-
Root Cause - Identify underlying issue
-
Fix Implementation - Deploy the solution
-
Verification - Confirm resolution
-
Status Update - Communicate resolution
Workflow 3: Post-Mortem
-
Timeline - Document what happened when
-
Root Cause Analysis - 5 whys analysis
-
Action Items - Identify preventive measures
-
Documentation - Write post-mortem report
-
Review - Share learnings with team
Quick Reference
Action Command
Start incident "We have a production incident"
Triage "What's the severity and impact?"
Post-mortem "Create post-mortem for incident"