How to Deal With AWS RDS Maintenance Windows

Amazon RDS is one of the best MySQL-based DBaaS services from Amazon AWS. It provides high availability, resizable capacity, and consistent performance to your applications. To take advantage of the RDS features, we need to design, operate and apply the best practices to RDS to utilize the capability of it to the max extent.

In the last 2 months, we had so many security issues identified in Linux, hypervisors, and MySQL applications that impacted on the Amazon Infrastructure too. To mitigate the security issues, Amazon needs to perform some maintenance activity on the underlying EC2 Instances of RDS MySQL and to patch to the MySQL supported versions. These activities will impact the availability of RDS Instance during the maintenance window.

Amazon is introducing many new features to its existing services to provide top-notch solutions to its customers. Recently, they introduced General Purpose SSD and Provisioned IOPS (SSD) Storage Volumes for RDS instances to deliver fast, predictable, and consistent performance for I/O intensive transactional database workloads. They also introduced other new features like memory-optimized DB Instances, pre-warming InnoDB buffer pool on reboot and even more. Yet again, you need to reboot your RDS instances to take advantage of these new features.
To mitigate the risk of RDS instances unavailability during the maintenance window, some good practices come in handy. Let’s see how to deal with them:

1. Turn on Multi-AZ mode. This is the first and foremost thing to do to improve the availability and enable the built-in automated fail-over from your primary database to a synchronously replicated secondary database in case of a failure or reboot or any maintenance activity.

2. Enable Event Subscriptions to get the notifications of all the Events happening on the RDS instance.

When you subscribe for the Events, it will deliver the Events details to the given Notification Email IDs.

3. Enable CloudWatch metrics on RDS Instances to monitor the replication status between the Master and Read-Replicas. Replication may fail because of changes to the master RDS instance or DB Instance shutdown, so it’s good to have this feature on.

4. Verify the RDS DB Instance reachability, memory and number of DB Connections to understand whether it receives connections or not.

5. In Multi-AZ mode, RDS might take some 30 – 300 seconds to switch to the Fail-Over node, so notify the respective stakeholders on the maintenance activity and approximate downtime.
This is the minimum set of things that you should enable to deal with the AWS RDS maintenance and minimize your downtime. Nevertheless, additional hints and best practices should be deployed

to further increase both the availability and performance of your infrastructure:

  • Read-Replica in Cross-Region: create a read-replica in cross-region to maximize the availability. Whenever primary region outage happens, we can promote the read-replica as the master instance and get the DB instance available all the time.
  • General Purpose (SSD) storage for Consistent Performance: for small and medium database workloads, modify the Storage Type to the General Purpose (SSD) Storage for consistent IOPS delivery for Database operations.
  • Change the Instance type to Current Generation: change the RDS Instance type to Current Generation Instance Types: T2, M3, R3 as per your workload requirements. The newer generation instances will give us the best RAM, CPU, and Networking capabilities compared to the previous generation instance types T1, M1, C1, M2, etc.
  • Tune the MySQL Parameters of RDS as you Scale: When you Scale up and Scale down your RDS instance, there will be many parameters depending on your RDS IOPS, Memory, CPU and networking. Tune them accordingly, otherwise, it will lead to bad performance of the RDS instances.

Avatar

Written by

Praveen Kumar Muppala

I have strong experience on Multiple Unix/Linux flavours, LAMP Stack, Monitoring Systems, Database, NoSQL. I love to explore the new concepts/services in Cloud Computing World. I have written 4 certifications in different flavours of Linux/Unix.


Related Posts

Alisha Reyes
Alisha Reyes
— January 6, 2020

New on Cloud Academy: Red Hat, Agile, OWASP Labs, Amazon SageMaker Lab, Linux Command Line Lab, SQL, Git Labs, Scrum Master, Azure Architects Lab, and Much More

Happy New Year! We hope you're ready to kick your training in overdrive in 2020 because we have a ton of new content for you. Not only do we have a bunch of new courses, hands-on labs, and lab challenges on AWS, Azure, and Google Cloud, but we also have three new courses on Red Hat, th...

Read more
  • agile
  • AWS
  • Azure
  • Google Cloud Platform
  • Linux
  • OWASP
  • programming
  • red hat
  • scrum
Alisha Reyes
Alisha Reyes
— December 24, 2019

Cloud Academy’s Blog Digest: Azure Best Practices, 6 Reasons You Should Get AWS Certified, Google Cloud Certification Prep, and more

Happy Holidays from Cloud Academy We hope you have a wonderful holiday season filled with family, friends, and plenty of food. Here at Cloud Academy, we are thankful for our amazing customer like you.  Since this time of year can be stressful, we’re sharing a few of our latest article...

Read more
  • AWS
  • azure best practices
  • blog digest
  • Cloud Academy
  • Google Cloud
Avatar
Guy Hummel
— December 12, 2019

Google Cloud Platform Certification: Preparation and Prerequisites

Google Cloud Platform (GCP) has evolved from being a niche player to a serious competitor to Amazon Web Services and Microsoft Azure. In 2019, research firm Gartner placed Google in the Leaders quadrant in its Magic Quadrant for Cloud Infrastructure as a Service for the second consecuti...

Read more
  • AWS
  • Azure
  • Google Cloud Platform
Alisha Reyes
Alisha Reyes
— December 10, 2019

New Lab Challenges: Push Your Skills to the Next Level

Build hands-on experience using real accounts on AWS, Azure, Google Cloud Platform, and more Meaningful cloud skills require more than book knowledge. Hands-on experience is required to translate knowledge into real-world results. We see this time and time again in studies about how pe...

Read more
  • AWS
  • Azure
  • Google Cloud
  • hands-on
  • labs
Alisha Reyes
Alisha Reyes
— December 5, 2019

New on Cloud Academy: AWS Solution Architect Lab Challenge, Azure Hands-on Labs, Foundation Certificate in Cyber Security, and Much More

Now that Thanksgiving is over and the craziness of Black Friday has died down, it's now time for the busiest season of the year. Whether you're a last-minute shopper or you already have your shopping done, the holidays bring so much more excitement than any other time of year. Since our...

Read more
  • AWS
  • AWS solution architect
  • AZ-203
  • Azure
  • cyber security
  • FCCS
  • Foundation Certificate in Cyber Security
  • Google Cloud Platform
  • Kubernetes
Avatar
Cloud Academy Team
— December 4, 2019

Understanding Enterprise Cloud Migration

What is enterprise cloud migration? Cloud migration is about moving your data, applications, and even infrastructure from your on-premises computers or infrastructure to a virtual pool of on-demand, shared resources that offer compute, storage, and network services at scale. Why d...

Read more
  • AWS
  • Azure
  • Data Migration
Wendy Dessler
Wendy Dessler
— November 27, 2019

6 Reasons Why You Should Get an AWS Certification This Year

In the past decade, the rise of cloud computing has been undeniable. Businesses of all sizes are moving their infrastructure and applications to the cloud. This is partly because the cloud allows businesses and their employees to access important information from just about anywhere. ...

Read more
  • AWS
  • Certifications
  • certified
Avatar
Andrea Colangelo
— November 26, 2019

AWS Regions and Availability Zones: The Simplest Explanation You Will Ever Find Around

The basics of AWS Regions and Availability Zones We’re going to treat this article as a sort of AWS 101 — it’ll be a quick primer on AWS Regions and Availability Zones that will be useful for understanding the basics of how AWS infrastructure is organized. We’ll define each section,...

Read more
  • AWS
Avatar
Dzenan Dzevlan
— November 20, 2019

Application Load Balancer vs. Classic Load Balancer

What is an Elastic Load Balancer? This post covers basics of what an Elastic Load Balancer is, and two of its examples: Application Load Balancers and Classic Load Balancers. For additional information — including a comparison that explains Network Load Balancers — check out our post o...

Read more
  • ALB
  • Application Load Balancer
  • AWS
  • Elastic Load Balancer
  • ELB
Albert Qian
Albert Qian
— November 13, 2019

Advantages and Disadvantages of Microservices Architecture

What are microservices? Let's start our discussion by setting a foundation of what microservices are. Microservices are a way of breaking large software projects into loosely coupled modules, which communicate with each other through simple Application Programming Interfaces (APIs). ...

Read more
  • AWS
  • Docker
  • Kubernetes
  • Microservices
Nisar Ahmad
Nisar Ahmad
— November 12, 2019

Kubernetes Services: AWS vs. Azure vs. Google Cloud

Kubernetes is a popular open-source container orchestration platform that allows us to deploy and manage multi-container applications at scale. Businesses are rapidly adopting this revolutionary technology to modernize their applications. Cloud service providers — such as Amazon Web Ser...

Read more
  • AWS
  • Azure
  • Google Cloud
  • Kubernetes
Avatar
Stuart Scott
— October 31, 2019

AWS Internet of Things (IoT): The 3 Services You Need to Know

The Internet of Things (IoT) embeds technology into any physical thing to enable never-before-seen levels of connectivity. IoT is revolutionizing industries and creating many new market opportunities. Cloud services play an important role in enabling deployment of IoT solutions that min...

Read more
  • AWS
  • AWS IoT Events
  • AWS IoT SiteWise
  • AWS IoT Things Graph
  • IoT