AWS Cloud Operations Blog
Category: Learning Levels
Maximize cloud investment value through operational excellence using AWS Managed Services
In this blog post, I share my observations as an AMS Solutions Architect on how achieving operational excellence can help organizations realize their cloud business objectives while migrating to AWS. I dive deep into the five design principles that AWS Managed Services (AMS) uses to achieve operational excellence. Amazon is guided by four principles: customer […]
Use AWS Systems Manager Session Manager for port forwarding to Amazon ElastiCache for Redis inside a private subnet
With the increasing adoption of the public cloud, customers must minimize the attack surface of their infrastructure. When it comes to optimizing the response time of read-intensive applications, data caching is one of the first steps to consider. Amazon ElastiCache for Redis is versatile in-memory storage that offers highly available, highly scalable, and extremely fast […]
Scaling Well-Architected reviews with the AWS Well-Architected Tool
The AWS Well-Architected Framework describes key concepts and architectural best practices that help cloud architects build secure, high-performing, resilient, and efficient infrastructure for their applications and workloads in the cloud. An AWS Well-Architected Review includes questions around the AWS Well-Architected Framework that can help application owners ensure that their workloads are following best practices. The […]
Improve monitoring of AWS Systems Manager Agent
The ability to present a single pane of glass simplifies the process of tracking and controlling IT systems. Enterprises that run workloads on AWS use AWS Systems Manager because of its security, ease of management, and centralized reporting. When an agent loses connection to the management platform, you can lose visibility into system behavior and […]
How to easily configure DevOps Guru across your organization with Systems Manager Quick Setup
Amazon DevOps Guru is a machine learning (ML) powered service that helps developers and operators automatically detect anomalies and improve application availability. DevOps Guru utilizes machine learning models informed by years of Amazon.com and AWS operational excellence in order to identify anomalous application behavior (e.g., increased latency, error rates, resource constraints) and surface critical issues that […]
Managing and monitoring API throttling in your workloads
When you’re architecting for the cloud, you need to keep API throttling in mind, particularly the types of calls and the frequency with which they are called. When the allotted rate limit for an API call is exceeded, you’ll receive an error response and the call will be throttled. Excessive API throttling can result in […]
Automate configuration compliance at scale in AWS
AWS Config continuously monitors and records your AWS resource configurations. You can use the service to automate the evaluation and remediation of recorded configurations against desired configurations. You also can review changes in configurations and relationships between AWS resources and dive into the history of a resource configuration. The basis of a well-architected multi-account AWS […]
Developing, versioning, testing, and deploying landing zone changes using CfCT across multiple landing zones
Enterprise customers often ask how they can minimize risk when they’re developing and testing a landing zone configuration. They also want to know how they can promote code between multiple landing zones. AWS Control Tower provides the easiest way to set up and govern a secure, multi-account AWS environment, called a landing zone. Customers who […]
Operational insights in Systems Manager OpsCenter help you identify duplicate issues and noisy event sources
If you use AWS Systems Manager OpsCenter, you might be familiar with the challenges of large numbers of OpsItems. When the same problem causes the creation of a significant number of OpsItems, it can be hard to see that these OpsItems are in fact the result of a single issue. It can also be difficult […]
Cost optimization in AWS using Amazon CloudWatch metric streams, AWS Cost and Usage Reports and Amazon Athena
You can use metric streams to create continuous, near-real-time streams of Amazon CloudWatch metrics to a destination of your choice. Metric streams make it easier to send CloudWatch metrics to popular third-party service providers using an Amazon Kinesis Data Firehose HTTP endpoint. You can create a continuous, scalable stream that includes the most up-to-date CloudWatch […]