Skip to main content

How to Deal With AWS RDS Maintenance Windows

Amazon RDS is one of the best MySQL-based DBaaS services from Amazon AWS. It provides high availability, resizable capacity, and consistent performance to your applications. To take advantage of the RDS features, we need to design, operate and apply the best practices to RDS to utilize the capability of it to the max extent.

In the last 2 months, we had so many security issues identified in Linux, hypervisors, and MySQL applications that impacted on the Amazon Infrastructure too. To mitigate the security issues, Amazon needs to perform some maintenance activity on the underlying EC2 Instances of RDS MySQL and to patch to the MySQL supported versions. These activities will impact the availability of RDS Instance during the maintenance window.

Amazon is introducing many new features to its existing services to provide top-notch solutions to its customers. Recently, they introduced General Purpose SSD and Provisioned IOPS (SSD) Storage Volumes for RDS instances to deliver fast, predictable, and consistent performance for I/O intensive transactional database workloads. They also introduced other new features like memory-optimized DB Instances, pre-warming InnoDB buffer pool on reboot and even more. Yet again, you need to reboot your RDS instances to take advantage of these new features.
To mitigate the risk of RDS instances unavailability during the maintenance window, some good practices come in handy. Let’s see how to deal with them:

1. Turn on Multi-AZ mode. This is the first and foremost thing to do to improve the availability and enable the built-in automated fail-over from your primary database to a synchronously replicated secondary database in case of a failure or reboot or any maintenance activity.

2. Enable Event Subscriptions to get the notifications of all the Events happening on the RDS instance.

When you subscribe for the Events, it will deliver the Events details to the given Notification Email IDs.

3. Enable CloudWatch metrics on RDS Instances to monitor the replication status between the Master and Read-Replicas. Replication may fail because of changes to the master RDS instance or DB Instance shutdown, so it’s good to have this feature on.

4. Verify the RDS DB Instance reachability, memory and number of DB Connections to understand whether it receives connections or not.

5. In Multi-AZ mode, RDS might take some 30 – 300 seconds to switch to the Fail-Over node, so notify the respective stakeholders on the maintenance activity and approximate downtime.
This is the minimum set of things that you should enable to deal with the AWS RDS maintenance and minimize your downtime. Nevertheless, additional hints and best practices should be deployed

to further increase both the availability and performance of your infrastructure:

  • Read-Replica in Cross-Region: create a read-replica in cross-region to maximize the availability. Whenever primary region outage happens, we can promote the read-replica as the master instance and get the DB instance available all the time.
  • General Purpose (SSD) storage for Consistent Performance: for small and medium database workloads, modify the Storage Type to the General Purpose (SSD) Storage for consistent IOPS delivery for Database operations.
  • Change the Instance type to Current Generation: change the RDS Instance type to Current Generation Instance Types: T2, M3, R3 as per your workload requirements. The newer generation instances will give us the best RAM, CPU, and Networking capabilities compared to the previous generation instance types T1, M1, C1, M2, etc.
  • Tune the MySQL Parameters of RDS as you Scale: When you Scale up and Scale down your RDS instance, there will be many parameters depending on your RDS IOPS, Memory, CPU and networking. Tune them accordingly, otherwise, it will lead to bad performance of the RDS instances.

Written by

Praveen Kumar Muppala

I have strong experience on Multiple Unix/Linux flavours, LAMP Stack, Monitoring Systems, Database, NoSQL. I love to explore the new concepts/services in Cloud Computing World. I have written 4 certifications in different flavours of Linux/Unix.

Related Posts

Sanket Dangi
— February 11, 2019

WaitCondition Controls the Pace of AWS CloudFormation Templates

AWS's WaitCondition can be used with CloudFormation templates to ensure required resources are running.As you may already be aware, AWS CloudFormation is used for infrastructure automation by allowing you to write JSON templates to automatically install, configure, and bootstrap your ...

Read more
  • AWS
  • formation
Andrew Larkin
— January 24, 2019

The 9 AWS Certifications: Which is Right for You and Your Team?

As companies increasingly shift workloads to the public cloud, cloud computing has moved from a nice-to-have to a core competency in the enterprise. This shift requires a new set of skills to design, deploy, and manage applications in cloud computing.As the market leader and most ma...

Read more
  • AWS
  • AWS certifications
Andrew Larkin
— November 28, 2018

Two New EC2 Instance Types Announced at AWS re:Invent 2018 – Monday Night Live

The announcements at re:Invent just keep on coming! Let’s look at what benefits these two new EC2 instance types offer and how these two new instances could be of benefit to you. If you're not too familiar with Amazon EC2, you might want to familiarize yourself by creating your first Am...

Read more
  • AWS
  • EC2
  • re:Invent 2018
Guy Hummel
— November 21, 2018

Google Cloud Certification: Preparation and Prerequisites

Google Cloud Platform (GCP) has evolved from being a niche player to a serious competitor to Amazon Web Services and Microsoft Azure. In 2018, research firm Gartner placed Google in the Leaders quadrant in its Magic Quadrant for Cloud Infrastructure as a Service for the first time. In t...

Read more
  • AWS
  • Azure
  • Google Cloud
Khash Nakhostin
Khash Nakhostin
— November 13, 2018

Understanding AWS VPC Egress Filtering Methods

In order to understand AWS VPC egress filtering methods, you first need to understand that security on AWS is governed by a shared responsibility model where both vendor and subscriber have various operational responsibilities. AWS assumes responsibility for the underlying infrastructur...

Read more
  • Aviatrix
  • AWS
  • VPC
Jeremy Cook
— November 10, 2018

S3 FTP: Build a Reliable and Inexpensive FTP Server Using Amazon’s S3

Is it possible to create an S3 FTP file backup/transfer solution, minimizing associated file storage and capacity planning administration headache?FTP (File Transfer Protocol) is a fast and convenient way to transfer large files over the Internet. You might, at some point, have conf...

Read more
  • Amazon S3
  • AWS
Guy Hummel
— October 18, 2018

Microservices Architecture: Advantages and Drawbacks

Microservices are a way of breaking large software projects into loosely coupled modules, which communicate with each other through simple Application Programming Interfaces (APIs).Microservices have become increasingly popular over the past few years. The modular architectural style,...

Read more
  • AWS
  • Microservices
Stuart Scott
— October 2, 2018

What Are Best Practices for Tagging AWS Resources?

There are many use cases for tags, but what are the best practices for tagging AWS resources? In order for your organization to effectively manage resources (and your monthly AWS bill), you need to implement and adopt a thoughtful tagging strategy that makes sense for your business. The...

Read more
  • AWS
  • cost optimization
Stuart Scott
— September 26, 2018

How to Optimize Amazon S3 Performance

Amazon S3 is the most common storage options for many organizations, being object storage it is used for a wide variety of data types, from the smallest objects to huge datasets. All in all, Amazon S3 is a great service to store a wide scope of data types in a highly available and resil...

Read more
  • Amazon S3
  • AWS
Cloud Academy Team
— September 18, 2018

How to Optimize Cloud Costs with Spot Instances: New on Cloud Academy

One of the main promises of cloud computing is access to nearly endless capacity. However, it doesn’t come cheap. With the introduction of Spot Instances for Amazon Web Services’ Elastic Compute Cloud (AWS EC2) in 2009, spot instances have been a way for major cloud providers to sell sp...

Read more
  • AWS
  • Azure
  • Google Cloud
  • SpotInst
Guy Hummel and Jeremy Cook
— August 23, 2018

What are the Benefits of Machine Learning in the Cloud?

A Comparison of Machine Learning Services on AWS, Azure, and Google CloudArtificial intelligence and machine learning are steadily making their way into enterprise applications in areas such as customer support, fraud detection, and business intelligence. There is every reason to beli...

Read more
  • AWS
  • Azure
  • Google Cloud
  • Machine Learning
Stuart Scott
— August 17, 2018

How to Use AWS CLI

The AWS Command Line Interface (CLI) is for managing your AWS services from a terminal session on your own client, allowing you to control and configure multiple AWS services.So you’ve been using AWS for awhile and finally feel comfortable clicking your way through all the services....

Read more
  • AWS