Operational Excellence — Best Practices
Design Principles
- Perform operations as code (IaC)
- Make frequent, small, reversible changes
- Refine operations procedures frequently
- Anticipate failure
- Learn from all operational failures
Key AWS Services
- CloudFormation — IaC
- Config — compliance
- CloudWatch — monitoring
- CloudTrail — audit
- X-Ray — distributed tracing
- Systems Manager — operations management