Ask HN: Validate decision-loss problem in incidents
2 points
1 hour ago
| 1 comment
How do teams retain reasoning behind incident decisions?
I’ve noticed something in a few teams I’ve worked with,
After an incident, we usually:
identify a root cause
agree on some actions
close the case
But a few weeks later, when something similar happens again, it’s hard to answer:
why that root cause was believed?
what evidence did we produce at that time?
whether there was any disagreement in the team.Most of this context seems to be scattered across Slack, Jira, calls, etc. I am curious if you guys actually run into this problem?
Or is this not really an issue in most teams?
▲RCAs should include all of this information (answers to your questions for the next time). If not, the RCA is incomplete.
reply