Search⌘ K

Disk Failure Modes

Understand the different types of disk failures such as latent sector errors and silent block corruption. Learn how these partial failures impact data integrity and why detecting and recovering from them is vital for building reliable storage systems.

We'll cover the following...

As you learned in the chapter about RAID, disks are not perfect and can fail (on occasion). In early RAID systems, the model of failure was quite simple: either the entire disk is working, or it fails completely, and the detection of such a failure is straightforward. This fail-stop model of disk failure makes building RAID relatively simple“Implementing Fault-Tolerant Services Using The State Machine Approach: A Tutorial” by Fred B. Schneider. ACM Surveys, Vol. 22, No. 4, December 1990. How to build fault tolerant services. A must read for those building distributed systems..

What you didn’t learn is about all of the other types of failure modes modern disks exhibit. Specifically, as ...