Reliability on The Cloud
The reliability pillar includes the ability of a system to recover from infrastructure or service disruptions, dynamically acquire computing resources to meet demand, and mitigate disruptions such as misconfigurations or transient network issues.
We'll cover the following...
- Design Principles: The five design principles for reliability on the cloud:-
- Test recovery procedures:
- Automatically recover from failure:
- Scale horizontally to increase aggregate system availability:
- Stop guessing capacity:
- Manage change in automation:
- Definition
- Best Practices Foundations
- Change Management
- Failure Management
- Key Services
- Foundations:
- Change Management:
- Failure Management:
The reliability pillar includes the ability of a system to recover from infrastructure or service disruptions, dynamically acquire computing resources to meet demand, and mitigate disruptions such as misconfigurations or transient network issues.