Skip to main content

Operational Excellence — Best Practices

Design Principles

  • Perform operations as code (IaC)
  • Make frequent, small, reversible changes
  • Refine operations procedures frequently
  • Anticipate failure
  • Learn from all operational failures

Key AWS Services

  • CloudFormation — IaC
  • Config — compliance
  • CloudWatch — monitoring
  • CloudTrail — audit
  • X-Ray — distributed tracing
  • Systems Manager — operations management