Well Architected Framework: Reliability
Get introduced to the reliability design principles.
We'll cover the following
The reliability pillar covers the ability of a system to recover from service or infrastructure outages/disruptions as well as the ability to dynamically acquire computing resources to meet demand.
E.g. use of chaos monkey to test recovery procedure.
Design principles
- Test recovery procedures
- Automatically recover from failure
- Scale horizontally to increase aggregate system availability
- Stop estimating/guesting capacity. E.g. No under-provisioning or over-provisioning.
Reliability on the cloud
Reliability on the cloud consists of three areas:
- Foundations
- Change Management
- Failure Management
Get hands-on with 1400+ tech skills courses.