AWS Operational Excellence Pillar

Last Updated : 20-Dec-2020

Design Principles

  • Perform operations as code: scripts all operational procedures to limit human error and enable consistent responses to events.
  • Make frequent, small, reversible changes: design to allow frequent incremental updates with easy reversal of change
  • Refine operations procedures frequently: Lean and improve continuously
  • Anticipate failure: Test your failure scenarios and response procedures using game days
  • Learn from all operational failures: Actively lean lessons learned from all operational events and failures

Best Practices

  • Organization – understand business priorities
  • Prepare – build solution with metrics to measure success and create pipeline for build, configuration, testing and deployment
  • Operate – monitor metrics for the applications and the operations processes
  • Evolve – regularly evaluate, prioritize and deliver on opportunities for improvement


Management & Governance
• AWS AppConfig
• Auto Scaling
• AWS Backint Agent for SAP HANA
• AWS Chatbot
• AWS CloudFormation
• AWS CloudTrail
• Amazon CloudWatch
• AWS Command Line Interface (AWS CLI)
• AWS Compute Optimizer
• AWS Config
• AWS Console Mobile Application
• AWS Control Tower
• Amazon Data Lifecycle Manager
• AWS Health
• AWS License Manager
• Amazon Managed Service for Grafana (AMG)
• Amazon Managed Service for Prometheus (AMP)
• AWS Management Console
• AWS OpsWorks
• AWS Organizations
• AWS Proton
• AWS Service Catalog
• Service Quotas
• AWS Systems Manager
• AWS Tools for Powershell
• AWS Trusted Advisor
• AWS Well-Architected Tool

Using Template: Template Post
magnifier linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram