/incident-lifecycle-model

A lifecycle model for describing incident management

Extended Dreyfus Model for Incident Lifecycles

This model describes the life cycle of incidents in the context of large-scale technology infrastructure, beginning at detection and running through response, analysis, remediation, and finally prevention.

It is organized using the Dreyfus Model of skill acquisition. In this way, it is not intended to be prescriptive or metrics-driven, but rather to describe language patterns and behaviors one would observe in oneself or others as they detect, react to, and remediate incidents.

Given this, it also possible that organizations may fall into different descriptive categories within the model, i.e. "'Expert' at 'detection'; and 'beginner' at 'remedation'," etc.

The model is based upon various actual behaviors and language patterns observed in some of the largest technology companies in the world, including many of the "DevOps unicorns."

Pull requests to correct errors or update language and/or behavioral patterns are welcome!

Licensed under a Creative Commons version 4.0 BY-NC-ND license:

  • Attribution - Please attribute the original authors:
  • Noncommercial - Contact @jpaulreed to discuss commercial usage.
  • NoDerivatives - But we accept Pull Requests! (And will add you to the authors list!)