kaizen:why

Five Whys Analysis

Apply Five Whys root cause analysis to investigate issues by iteratively asking "why" to drill from symptoms to root causes.

Description

Iteratively ask "why" to move from surface symptoms to fundamental causes. Identifies systemic issues rather than quick fixes.

Usage

/why [issue_description]

Variables

ISSUE: Problem or symptom to analyze (default: prompt for input)
DEPTH: Number of "why" iterations (default: 5, adjust as needed)

Steps

State the problem clearly
Ask "Why did this happen?" and document the answer
For that answer, ask "Why?" again
Continue until reaching root cause (usually 5 iterations)
Validate by working backwards: root cause → symptom
Explore branches if multiple causes emerge
Propose solutions addressing root causes, not symptoms

Examples

Example 1: Production Bug

Problem: Users see 500 error on checkout Why 1: Payment service throws exception Why 2: Request timeout after 30 seconds Why 3: Database query takes 45 seconds Why 4: Missing index on transactions table Why 5: Index creation wasn't in migration scripts Root Cause: Migration review process doesn't check query performance

Solution: Add query performance checks to migration PR template

Example 2: CI/CD Pipeline Failures

Problem: E2E tests fail intermittently Why 1: Race condition in async test setup Why 2: Test doesn't wait for database seed completion Why 3: Seed function doesn't return promise Why 4: TypeScript didn't catch missing return type Why 5: strict mode not enabled in test config Root Cause: Inconsistent TypeScript config between src and tests

Solution: Unify TypeScript config, enable strict mode everywhere

Example 3: Multi-Branch Analysis

Problem: Feature deployment takes 2 hours

Branch A (Build): Why 1: Docker build takes 90 minutes Why 2: No layer caching Why 3: Dependencies reinstalled every time Why 4: Cache invalidated by timestamp in Dockerfile Root Cause A: Dockerfile uses current timestamp for versioning

Branch B (Tests): Why 1: Test suite takes 30 minutes Why 2: Integration tests run sequentially Why 3: Test runner config has maxWorkers: 1 Why 4: Previous developer disabled parallelism due to flaky tests Root Cause B: Flaky tests masked by disabling parallelism

Solutions: A) Remove timestamp from Dockerfile, use git SHA B) Fix flaky tests, re-enable parallel test execution

Notes

Don't stop at symptoms; keep digging for systemic issues
Multiple root causes may exist - explore different branches
Document each "why" for future reference
Consider both technical and process-related causes
The magic isn't in exactly 5 whys - stop when you reach the true root cause
Stop when you hit systemic/process issues, not just technical details
Multiple root causes are common—explore branches separately
If "human error" appears, keep digging: why was error possible?
Document every "why" for future reference
Root cause usually involves: missing validation, missing docs, unclear process, or missing automation
Test solutions: implement → verify symptom resolved → monitor for recurrence

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

sdd:plan

sdd:implement

customaize-agent:prompt-engineering

code-review:review-local-changes