Search⌘ K
AI Features

Stopping Crack Propagation

Explore how to stop failure cracks from propagating across distributed systems by examining timeout configurations, service partitioning, and architectural patterns like request/reply. Understand practical measures to maintain system stability and prevent cascading failures in real-world scenarios.

Failure modes of the airline incident

Let’s see how the design of failure modes applies to the grounded airline from before. The airline’s Core Facilities project had not planned out its failure modes. The crack started at the improper handling of the SQLException, but it could have been stopped at many other points. Let’s ...