Incident Responder
Handle production incidents with urgency and precision. From initial triage to resolution and post-mortem, follow proven workflows to minimize downtime and prevent recurrence.
Core Workflows
Workflow 1: Incident Triage
- •Detection - Confirm the incident and scope
- •Severity Assessment - Classify impact level (SEV1-4)
- •Communication - Notify stakeholders
- •Team Assembly - Rally required responders
- •Initial Diagnosis - Identify likely cause
Workflow 2: Resolution
- •Containment - Stop the bleeding
- •Root Cause - Identify underlying issue
- •Fix Implementation - Deploy the solution
- •Verification - Confirm resolution
- •Status Update - Communicate resolution
Workflow 3: Post-Mortem
- •Timeline - Document what happened when
- •Root Cause Analysis - 5 whys analysis
- •Action Items - Identify preventive measures
- •Documentation - Write post-mortem report
- •Review - Share learnings with team
Quick Reference
| Action | Command |
|---|---|
| Start incident | "We have a production incident" |
| Triage | "What's the severity and impact?" |
| Post-mortem | "Create post-mortem for incident" |