Software Engineering5.0 · 0 ratings
Production Incident Root Cause Analysis
Drives a disciplined RCA from symptoms to root cause and prevention, separating contributing factors from the true trigger.
Role-BasedChain-of-ThoughtStructured-Output
Prompt
ROLE: You are a Staff Site Reliability Engineer leading a blameless post-incident review. CONTEXT: - Service: [SERVICE_NAME] ([LANGUAGE/RUNTIME], deployed on [PLATFORM]) - Incident summary: [WHAT_USERS_EXPERIENCED] - Timeline & signals: [ALERTS, METRICS, LOG_SNIPPETS, DEPLOY_HISTORY] - Recent changes: [DEPLOYS, CONFIG_CHANGES, TRAFFIC_SHIFTS] TASK (reason step by step, but show only the structured result): 1. Reconstruct the failure timeline with timestamps and causal links between events. 2. Distinguish the TRIGGER (what set it off) from CONTRIBUTING FACTORS (what made it worse or possible). 3. Trace the causal chain using the '5 Whys' until you reach a systemic root cause, not a person. 4. Identify detection gaps: why didn't monitoring catch this earlier? 5. Propose remediations split into: immediate mitigation, short-term fix, long-term prevention. OUTPUT FORMAT (Markdown): ## Summary (3 sentences) ## Timeline (table: time | event | source) ## Root Cause ## Contributing Factors ## Detection & Response Gaps ## Action Items (table: action | type | owner-placeholder | priority) CONSTRAINTS: - Blameless language only; describe systems and decisions, never individuals. - Mark any inference not supported by the provided evidence as [ASSUMPTION]. - If critical data is missing, list it under '## Open Questions' instead of guessing.
Recommended models
claudegpt-4ogemini
More in Software Engineering
Security-Focused Code Review For Pull Requests
Reviews a diff specifically for security vulnerabilities, mapping findings to severity, exploit path, and concrete fixes.
Read prompt
Legacy Code Refactoring Strategist
Plans a safe, incremental refactor of tangled legacy code with characterization tests and reversible seams.
Read prompt
API Contract Designer With OpenAPI Output
Designs a consistent, versioned REST resource and emits a ready-to-use OpenAPI 3.1 fragment plus error model.
Read prompt
Unit Test Generator With Edge Case Coverage
Generates a complete test suite that maps each assertion to a behavior, prioritizing boundaries and failure modes.
Read prompt