Monitoring AWS Infrastructure with CloudWatch

CloudWatch uses the mountains of data constantly being generated by your AWS resources to help you monitor and understand what’s really going on.

Closely monitoring your infrastructure is an integral part of any cloud deployment, and AWS CloudWatch offers a rich set of tools to help. The basic function of any monitoring tool is to collect and help you visualize data so you can take quick and effective action. We should give the same priority to application and server monitoring that we do to High Availability for our applications.

CloudWatch provides infrastructure level monitoring and, to some extent, application monitoring. You can access CloudWatch either from the AWS Console or through API and the command line.

CloudWatch monitors metrics describing the behavior of core AWS services within your account. These metrics give you the state of your AWS infrastructure and performance. Every Metric can be made to trigger an alarm, which sends notifications to specified end users through AWS’s Simple Notifications Service (SNS).

Cloudwatch offers two levels of monitoring: basic (no charge) and detailed. Basic metrics for some services are automatically applied, and seven pre-selected metrics are freely available for EC2 instances should you choose to enable them. Basic monitoring will generally provide checks every five minutes.

Detailed monitoring offers increased checking at a frequency of every minute, and costs $3.50 per instance per month. We will familiarize you with some of CloudWatch’s great features.

Cloudwatch main features

AWS resources covered by CloudWatch:

  • Amazon Ec2
  • EBS Volumes
  • AutoScaling Groups
  • Elastic Load Balancers
  • Amazon Route 53
  • RDS DB instances
  • DynamoDB tables
  • ElastiCache clusters
  • RedShift clusters
  • SQS queues
  • SNS topics
  • Storage Gateways
Cloudwatch-Enabled Services

CloudWatch features:

CloudWatch – Auto Scaling integration

Auto Scaling lets you automatically scale your servers up and down according to need. You can scale based on schedule, demand, or server hardware utilization. CloudWatch metrics like CPU usage and network utilization can be used to trigger scaling events. For example, suppose your application is running on two instances: you can, say, require one instance to terminate whenever your CPU utilization drops below 60%.

Reboot failed EC2 instances

We’ve shown how CloudWatch can send you notifications using AWS’s SNS. It can also be told to automatically reboot a failed EC2 instance on a failed status check due to loss of network connectivity, system power, or other software/hardware issues.

Integrate CloudWatch with third-party monitoring and logging tools

You can integrate CloudWatch with third-party monitoring tools like Copperegg, stackdriver, and New Relic. These third-party monitoring tools provide very fine performance monitoring, giving you a clear view of the status of your system processes. You can feed CloudWatch metrics to these services when can then be displayed on a custom dashboard.

Create Custom Metrics

Besides the default CloudWatch metrics like CPU Utilization, Network traffic, and disk read/writes, you may want to monitor more metrics, like memory utilization. You can easily define your own custom metrics. Once these metrics are available in CloudWatch, you can create alarms that trigger new actions.

AWS provides some additional monitoring scripts for adding custom metrics. When you install the scripts you can choose to report any desired combination of the following metrics:

  • Memory Utilization – Memory allocated by applications and the operating system, exclusive of caches and buffers, in percentages.
  • Memory Used – Memory allocated by applications and the operating system, exclusive of caches and buffers, in megabytes.
  • Memory Available – System memory available for applications and the operating system, in megabytes.
  • Disk Space Utilization – Disk space usage as percentages.
  • Disk Space Used – Disk space usage in gigabytes.
  • Disk Space Available – Available disk space in gigabytes.
  • Swap Space Utilization – Swap space usage as a percentage.
  • Swap Space Used – Swap space usage in megabytes.

VPC Flow Logs

Flow Logs have been available on AWS for only a couple of months. You can tell Flow Logs to track all inbound and outbound traffic moving through selected interfaces attached to your VPC. VPC flow Logs make it much easier to debug issues, like why you are not able to reach a particular instance.

You can also create CloudWatch metrics and alarms tied to network Flow Logs.

If you’re interested to learn more about Amazon CloudWatch, the Cloud Academy’s Getting Started to CloudWatch Course is your go-to course. Watch this short video taken from the course.

Avatar

Written by

Nitheesh Poojary

My professional IT career began nine years back when I was just out of my college. I worked with a great team as an infrastructure management engineer, managing hundreds of enterprise application servers. I found my passion when I got the opportunity to work with Cloud technologies: I'm addicted to AWS Cloud Services, DevOps engineering, and all the cloud tools and technologies that make engineers' lives easier. Currently, I am working as a Solution Architect in SixNines IT. We are an experienced team of engineers that have helped hundreds of customers move to the cloud responsibly. I have achieved 5 AWS certifications, happily helping fellow engineers across the globe through my blogs and answering questions in various forums.


Related Posts

Alisha Reyes
Alisha Reyes
— January 6, 2020

New on Cloud Academy: Red Hat, Agile, OWASP Labs, Amazon SageMaker Lab, Linux Command Line Lab, SQL, Git Labs, Scrum Master, Azure Architects Lab, and Much More

Happy New Year! We hope you're ready to kick your training in overdrive in 2020 because we have a ton of new content for you. Not only do we have a bunch of new courses, hands-on labs, and lab challenges on AWS, Azure, and Google Cloud, but we also have three new courses on Red Hat, th...

Read more
  • agile
  • AWS
  • Azure
  • Google Cloud Platform
  • Linux
  • OWASP
  • programming
  • red hat
  • scrum
Alisha Reyes
Alisha Reyes
— December 24, 2019

Cloud Academy’s Blog Digest: Azure Best Practices, 6 Reasons You Should Get AWS Certified, Google Cloud Certification Prep, and more

Happy Holidays from Cloud Academy We hope you have a wonderful holiday season filled with family, friends, and plenty of food. Here at Cloud Academy, we are thankful for our amazing customer like you.  Since this time of year can be stressful, we’re sharing a few of our latest article...

Read more
  • AWS
  • azure best practices
  • blog digest
  • Cloud Academy
  • Google Cloud
Avatar
Guy Hummel
— December 12, 2019

Google Cloud Platform Certification: Preparation and Prerequisites

Google Cloud Platform (GCP) has evolved from being a niche player to a serious competitor to Amazon Web Services and Microsoft Azure. In 2019, research firm Gartner placed Google in the Leaders quadrant in its Magic Quadrant for Cloud Infrastructure as a Service for the second consecuti...

Read more
  • AWS
  • Azure
  • Google Cloud Platform
Alisha Reyes
Alisha Reyes
— December 10, 2019

New Lab Challenges: Push Your Skills to the Next Level

Build hands-on experience using real accounts on AWS, Azure, Google Cloud Platform, and more Meaningful cloud skills require more than book knowledge. Hands-on experience is required to translate knowledge into real-world results. We see this time and time again in studies about how pe...

Read more
  • AWS
  • Azure
  • Google Cloud
  • hands-on
  • labs
Alisha Reyes
Alisha Reyes
— December 5, 2019

New on Cloud Academy: AWS Solution Architect Lab Challenge, Azure Hands-on Labs, Foundation Certificate in Cyber Security, and Much More

Now that Thanksgiving is over and the craziness of Black Friday has died down, it's now time for the busiest season of the year. Whether you're a last-minute shopper or you already have your shopping done, the holidays bring so much more excitement than any other time of year. Since our...

Read more
  • AWS
  • AWS solution architect
  • AZ-203
  • Azure
  • cyber security
  • FCCS
  • Foundation Certificate in Cyber Security
  • Google Cloud Platform
  • Kubernetes
Avatar
Cloud Academy Team
— December 4, 2019

Understanding Enterprise Cloud Migration

What is enterprise cloud migration? Cloud migration is about moving your data, applications, and even infrastructure from your on-premises computers or infrastructure to a virtual pool of on-demand, shared resources that offer compute, storage, and network services at scale. Why d...

Read more
  • AWS
  • Azure
  • Data Migration
Wendy Dessler
Wendy Dessler
— November 27, 2019

6 Reasons Why You Should Get an AWS Certification This Year

In the past decade, the rise of cloud computing has been undeniable. Businesses of all sizes are moving their infrastructure and applications to the cloud. This is partly because the cloud allows businesses and their employees to access important information from just about anywhere. ...

Read more
  • AWS
  • Certifications
  • certified
Avatar
Andrea Colangelo
— November 26, 2019

AWS Regions and Availability Zones: The Simplest Explanation You Will Ever Find Around

The basics of AWS Regions and Availability Zones We’re going to treat this article as a sort of AWS 101 — it’ll be a quick primer on AWS Regions and Availability Zones that will be useful for understanding the basics of how AWS infrastructure is organized. We’ll define each section,...

Read more
  • AWS
Avatar
Dzenan Dzevlan
— November 20, 2019

Application Load Balancer vs. Classic Load Balancer

What is an Elastic Load Balancer? This post covers basics of what an Elastic Load Balancer is, and two of its examples: Application Load Balancers and Classic Load Balancers. For additional information — including a comparison that explains Network Load Balancers — check out our post o...

Read more
  • ALB
  • Application Load Balancer
  • AWS
  • Elastic Load Balancer
  • ELB
Albert Qian
Albert Qian
— November 13, 2019

Advantages and Disadvantages of Microservices Architecture

What are microservices? Let's start our discussion by setting a foundation of what microservices are. Microservices are a way of breaking large software projects into loosely coupled modules, which communicate with each other through simple Application Programming Interfaces (APIs). ...

Read more
  • AWS
  • Docker
  • Kubernetes
  • Microservices
Nisar Ahmad
Nisar Ahmad
— November 12, 2019

Kubernetes Services: AWS vs. Azure vs. Google Cloud

Kubernetes is a popular open-source container orchestration platform that allows us to deploy and manage multi-container applications at scale. Businesses are rapidly adopting this revolutionary technology to modernize their applications. Cloud service providers — such as Amazon Web Ser...

Read more
  • AWS
  • Azure
  • Google Cloud
  • Kubernetes
Avatar
Stuart Scott
— October 31, 2019

AWS Internet of Things (IoT): The 3 Services You Need to Know

The Internet of Things (IoT) embeds technology into any physical thing to enable never-before-seen levels of connectivity. IoT is revolutionizing industries and creating many new market opportunities. Cloud services play an important role in enabling deployment of IoT solutions that min...

Read more
  • AWS
  • AWS IoT Events
  • AWS IoT SiteWise
  • AWS IoT Things Graph
  • IoT