Search⌘ K

Well Architected Framework: Reliability

Explore the reliability pillar of the AWS Well Architected Framework to understand how to design systems that recover from failures, scale efficiently, and manage change. Learn key concepts like automated recovery, monitoring with CloudWatch, and failure response to build robust cloud infrastructure.

The reliability pillar covers the ability of a system to recover from service or infrastructure outages/disruptions as well as the ability to dynamically acquire computing resources to meet demand.

E.g. use of chaos monkey to test recovery procedure.

Design principles

  1. Test recovery procedures
  2. Automatically recover from failure
...