5.4.4. Problem, Error, and Known Error
💡 First Principle: This is the most precisely-defined chain in the value-chain vocabulary — disaster, problem, error, known error — and the exam tests whether you can distinguish a cause (problem), a flaw (error), and a cause whose nature is understood (known error).
The terms, from broad to specific: a disaster is a sudden, major event causing serious disruption requiring significant recovery effort; a problem is a cause, or potential cause, of one or more incidents; an error is a flaw or vulnerability that may cause incidents; a known error is a problem that has been analysed and whose cause is understood (often with a documented workaround) but not yet permanently resolved.
The relationship: incidents are the symptoms you experience; a problem is the underlying cause; once you've analysed that problem and understand it, it becomes a known error. Understanding the cause (known error) lets you manage incidents better even before a permanent fix lands.
⚠️ Exam Trap: Distinguish carefully: a problem is a cause (possibly not yet understood); a known error is a problem whose cause is understood. The transition from problem to known error is analysis and understanding, not resolution — a known error can still be unresolved.
Reflection Question: A recurring outage's cause has been identified and documented with a workaround, but not permanently fixed. Is that a problem or a known error, and why?