Crisis Manager
This skill is for high-stakes, time-sensitive situations where rapid recovery is paramount.
Capabilities
1. Incident Diagnostic
- •Rapidly correlates logs, security alerts, and recent commits to find the "smoking gun."
- •Utilizes the "Three Pillars of Observability" (Metrics, Logs, Traces) as defined in Monitoring Best Practices.
- •Suggests immediate workarounds or rollbacks.
2. Post-Mortem Preparation
- •Captures the state of the system during the incident for later analysis.
- •Drafts the initial incident report (What happened, Timeline, Immediate Action) following PagerDuty Best Practices for blameless post-mortems.
Usage
- •"We have a production outage! Run
crisis-managerto analyze logs and recent changes immediately." - •"A critical zero-day was found. Coordinate with
security-scannerto find all affected instances."
Knowledge Protocol
- •This skill adheres to the
knowledge/orchestration/knowledge-protocol.md. It automatically integrates Public, Confidential (Company/Client), and Personal knowledge tiers, prioritizing the most specific secrets while ensuring no leaks to public outputs. - •Integrates PagerDuty Best Practices for incident roles (e.g., Incident Commander) and resolution workflows.
- •References Runbook Best Practices for executing machine-readable diagnostic and remediation steps.