Monitoring AWS Infrastructure with CloudWatch

CloudWatch uses the mountains of data constantly being generated by your AWS resources to help you monitor and understand what’s really going on.

Closely monitoring your infrastructure is an integral part of any cloud deployment, and AWS CloudWatch offers a rich set of tools to help. The basic function of any monitoring tool is to collect and help you visualize data so you can take quick and effective action. We should give the same priority to application and server monitoring that we do to High Availability for our applications.

CloudWatch provides infrastructure level monitoring and, to some extent, application monitoring. You can access CloudWatch either from the AWS Console or through API and the command line.

CloudWatch monitors metrics describing the behavior of core AWS services within your account. These metrics give you the state of your AWS infrastructure and performance. Every Metric can be made to trigger an alarm, which sends notifications to specified end users through AWS’s Simple Notifications Service (SNS).

Cloudwatch offers two levels of monitoring: basic (no charge) and detailed. Basic metrics for some services are automatically applied, and seven pre-selected metrics are freely available for EC2 instances should you choose to enable them. Basic monitoring will generally provide checks every five minutes.

Detailed monitoring offers increased checking at a frequency of every minute, and costs $3.50 per instance per month. We will familiarize you with some of CloudWatch’s great features.

Cloudwatch main features

AWS resources covered by CloudWatch:

  • Amazon Ec2
  • EBS Volumes
  • AutoScaling Groups
  • Elastic Load Balancers
  • Amazon Route 53
  • RDS DB instances
  • DynamoDB tables
  • ElastiCache clusters
  • RedShift clusters
  • SQS queues
  • SNS topics
  • Storage Gateways
Cloudwatch-Enabled Services

CloudWatch features:

CloudWatch – Auto Scaling integration

Auto Scaling lets you automatically scale your servers up and down according to need. You can scale based on schedule, demand, or server hardware utilization. CloudWatch metrics like CPU usage and network utilization can be used to trigger scaling events. For example, suppose your application is running on two instances: you can, say, require one instance to terminate whenever your CPU utilization drops below 60%.

Reboot failed EC2 instances

We’ve shown how CloudWatch can send you notifications using AWS’s SNS. It can also be told to automatically reboot a failed EC2 instance on a failed status check due to loss of network connectivity, system power, or other software/hardware issues.

Integrate CloudWatch with third-party monitoring and logging tools

You can integrate CloudWatch with third-party monitoring tools like Copperegg, stackdriver, and New Relic. These third-party monitoring tools provide very fine performance monitoring, giving you a clear view of the status of your system processes. You can feed CloudWatch metrics to these services when can then be displayed on a custom dashboard.

Create Custom Metrics

Besides the default CloudWatch metrics like CPU Utilization, Network traffic, and disk read/writes, you may want to monitor more metrics, like memory utilization. You can easily define your own custom metrics. Once these metrics are available in CloudWatch, you can create alarms that trigger new actions.

AWS provides some additional monitoring scripts for adding custom metrics. When you install the scripts you can choose to report any desired combination of the following metrics:

  • Memory Utilization – Memory allocated by applications and the operating system, exclusive of caches and buffers, in percentages.
  • Memory Used – Memory allocated by applications and the operating system, exclusive of caches and buffers, in megabytes.
  • Memory Available – System memory available for applications and the operating system, in megabytes.
  • Disk Space Utilization – Disk space usage as percentages.
  • Disk Space Used – Disk space usage in gigabytes.
  • Disk Space Available – Available disk space in gigabytes.
  • Swap Space Utilization – Swap space usage as a percentage.
  • Swap Space Used – Swap space usage in megabytes.

VPC Flow Logs

Flow Logs have been available on AWS for only a couple of months. You can tell Flow Logs to track all inbound and outbound traffic moving through selected interfaces attached to your VPC. VPC flow Logs make it much easier to debug issues, like why you are not able to reach a particular instance.

You can also create CloudWatch metrics and alarms tied to network Flow Logs.

If you’re interested to learn more about Amazon CloudWatch, the Cloud Academy’s Getting Started to CloudWatch Course is your go-to course. Watch this short video taken from the course.

Avatar

Written by

Nitheesh Poojary

My professional IT career began nine years back when I was just out of my college. I worked with a great team as an infrastructure management engineer, managing hundreds of enterprise application servers. I found my passion when I got the opportunity to work with Cloud technologies: I'm addicted to AWS Cloud Services, DevOps engineering, and all the cloud tools and technologies that make engineers' lives easier. Currently, I am working as a Solution Architect in SixNines IT. We are an experienced team of engineers that have helped hundreds of customers move to the cloud responsibly. I have achieved 5 AWS certifications, happily helping fellow engineers across the globe through my blogs and answering questions in various forums.


Related Posts

Amanda Cross
Amanda Cross
— April 9, 2021

New Content: Platforms, Programming, and DevOps – Something for Everyone

This month our team of expert certification specialists released three new or updated learning paths, 16 courses, 13 hands-on labs, and four lab challenges! New content on Cloud Academy You can always visit our Content Roadmap to see what’s just released as well as what’s coming soon....

Read more
  • alibaba
  • AWS
  • Azure
  • DevOps
  • Google Cloud Platform
  • programming
  • Security
Luca Casartelli
Luca Casartelli
— March 31, 2021

Mastering AWS Organizations Service Control Policies

Service Control Policies (SCPs) are IAM-like policies to manage permissions in AWS Organizations. SCPs restrict the actions allowed for accounts within the organization making each one of them compliant with your guidelines. SCPs are not meant to grant permissions; you should consider ...

Read more
  • AWS
  • Organizations
  • SCP
Amanda Cross
Amanda Cross
— March 12, 2021

New Content: Focus on DevOps and Programming Content this Month

This month our team of expert certification specialists released 12 new or updated learning paths, 15 courses, 25 hands-on labs, and four lab challenges! New content on Cloud Academy You can always visit our Content Roadmap to see what’s just released as well as what’s coming soon. Ja...

Read more
  • alibaba
  • AWS
  • Azure
  • DevOps
  • Google Cloud Platform
  • programming
Amanda Cross
Amanda Cross
— February 12, 2021

New Content: Get Ready for the CISM Cert Exam & Learn About Alibaba, Plus All the AWS, GCP, and Azure Courses You Know You Can Count On

This month our team of intrepid certification specialists released five learning paths, seven courses, 19 hands-on labs, and three lab challenges!  One particularly interesting new learning path is Certified Information Security Manager (CISM) Foundations. After completing this learn...

Read more
  • alibaba
  • AWS
  • Azure
  • cism
  • DevOps
  • Google Cloud Platform
  • programming
Avatar
Cloud Academy Team
— January 31, 2021

Which Certifications Should I Get?

The old AWS slogan, “Cloud is the new normal” is indeed a reality today. Really, cloud has been the new normal for a while now and getting credentials has become an increasingly effective way to quickly showcase your abilities to recruiters and companies. With all that in mind, the s...

Read more
  • AWS
  • Azure
  • Certifications
  • Cloud Computing
  • Google Cloud Platform
Avatar
Andrew Larkin
— January 31, 2021

The 12 AWS Certifications: Which is Right for You and Your Team?

As companies increasingly shift workloads to the public cloud, cloud computing has moved from a nice-to-have to a core competency in the enterprise. This shift requires a new set of skills to design, deploy, and manage applications in cloud computing. As the market leader and most ma...

Read more
  • AWS
  • AWS Certifications
Avatar
Stuart Scott
— January 29, 2021

AWS Certified Solutions Architect Associate: A Study Guide

Want to take a really impactful step in your technical career? Explore the AWS Solutions Architect Associate certificate. Its new version (SAA-C02) was released on March 23, 2020. The AWS Solutions Architect - Associate Certification (or Sol Arch Associate for short) offers some ...

Read more
  • AWS
  • AWS Certifications
  • AWS Certified Solutions Architect Associate
Amanda Cross
Amanda Cross
— January 7, 2021

New Content: AWS Terraform, Java Programming Lab Challenges, Azure DP-900 & DP-300 Certification Exam Prep, Plus Plenty More Amazon, Google, Microsoft, and Big Data Courses

This month our Content Team continues building the catalog of courses for everyone learning about AWS, GCP, and Microsoft Azure. In addition, this month’s updates include several Java programming lab challenges and a couple of courses on big data. In total, we released five new learning...

Read more
  • AWS
  • Azure
  • DevOps
  • Google Cloud Platform
  • Machine Learning
  • programming
Avatar
Stuart Scott
— December 17, 2020

Where Should You Be Focusing Your AWS Security Efforts?

Another day, another re:Invent session! This time I listened to Stephen Schmidt’s session, “AWS Security: Where we've been, where we're going.” Amongst covering the highlights of AWS security during 2020, a number of newly added AWS features/services were discussed, including: AWS Audit...

Read more
  • AWS
  • AWS re:Invent
  • cloud security
Joe Nemer
Joe Nemer
— December 4, 2020

AWS re:Invent: 2020 Keynote Top Highlights and More

We’ve gotten through the first five days of the special all-virtual 2020 edition of AWS re:Invent. It’s always a really exciting time for practitioners in the field to see what features and services AWS has cooked up for the year ahead.  This year’s conference is a marathon and not a...

Read more
  • AWS
  • AWS Glue Elastic Views
  • AWS re:Invent
Bryony Harrower
Bryony Harrower
— November 6, 2020

WARNING: Great Cloud Content Ahead

At Cloud Academy, content is at the heart of what we do. We work with the world’s leading cloud and operations teams to develop video courses and learning paths that accelerate teams and drive digital transformation. First and foremost, we listen to our customers’ needs and we stay ahea...

Read more
  • AWS
  • Azure
  • content roadmap
  • GCP
Joe Nemer
Joe Nemer
— October 25, 2020

Excelling in AWS, Azure, and Beyond – How Danut Prisacaru Prepares for the Future

Meet Danut Prisacaru. Danut has been a Software Architect for the past 10 years and has been involved in Software Engineering for 30 years. He’s passionate about software and learning, and jokes that coding is basically the only thing he can do well (!). We think his enthusiasm shines t...

Read more
  • AWS
  • careers
  • champions
  • upskilling