DevOps Runbooks Skill
Creates actionable runbooks for operational procedures, incident response, and system maintenance.
What This Skill Does
-
Creates operational runbooks
-
Documents incident procedures
-
Defines escalation paths
-
Provides troubleshooting guides
-
Documents rollback procedures
-
Captures operational knowledge
When to Use
-
Incident response planning
-
On-call documentation
-
System maintenance procedures
-
Disaster recovery planning
-
Knowledge transfer
Reference Files
- references/RUNBOOK.template.md
- Comprehensive operational runbook format
Runbook Structure
-
Overview - Purpose and when to use
-
Prerequisites - Access and tools needed
-
Quick Reference - Key commands and URLs
-
Procedure - Step-by-step with verification
-
Rollback - How to revert changes
-
Troubleshooting - Common issues
-
Escalation - When and how to escalate
Best Practices
-
Commands must be copy-pasteable
-
Include expected output for each step
-
Document decision points clearly
-
Define rollback at each step
-
Keep procedures current (test regularly)
-
Include escalation contacts