Amazon AWS: RAID 0 Configuration on EBS Volumes

Amazon Web Services EBS Volumes provides the block level storage volumes of Amazon EC2 instances. EBS volumes are highly available and reliable storage volumes that can be attached to the EC2 instances of the same Availability Zone over AWS internal network. Elastic Block Storage (EBS) is a storage service that is backed by network-connected block storage. So IOPS performance of an EBS volume is measured by the network throughput between the instance and EBS volume.
There are two kinds of EBS volumes available:
Standard Volumes offer cost effective storage and can deliver approximately 100 IOPS on average, or burst of hundreds of IOPS. Standard Volumes size can range between 1 GB to 1 TB.
Provisioned IOPS (PIOPS) Volumes are designed for heavy I/O intensive workloads like databases and consistent high I/O throughput. You can specify an IOPS rate when you create the volume. PIOPS Volumes size can range between 10 GB and 1 TB.
The Standard or Provisioned IOPS Volumes I/O performance will depend on the Volume Size and Instance Type.
RAID on EBS Volumes

RAID on EBS Volumes

To get both a faster I/O performance and more than 1TB of volume sizes, you can create a RAID 0 striping using more than one EBS volume.
Advantage: When we perform the RAID 0 Striping of multiple volumes, IOPS are distributed among the volumes of a stripe. If you add another volume to RAID 0, you get the straight addition of IOPS throughput of that volume and additional volume size.
To take the advantage of the RAID 0 striping, the instance type should support the High Networking Performance (like m3.xlarge, c3.2xlarge, r3.2xlarge, etc.).
Disadvantage: Loss of a single volume results in a complete data loss.
To avoid the data loss, we should take Snapshots of the RAID volumes frequently, and suspend the writes while taking the snapshot to ensure data integrity and reliability.
Here we will discuss on how to configure the RAID 0 Striping on 2 EBS volumes of several Linux machines. You can expand this to any number of volumes as per your requirements.

Create RAID Configuration

  1. Create the two EBS volumes with a size of 5 GB and attach them to the Amazon Linux Instance.
  2. Create a RAID 0 striped array using the two attached volumes:
    mdadm –create /dev/md0 –level=0 –level=stripe –raid-devices=2 /dev/xvdc /dev/xvdd
    (Assuming the 2 EBS volumes are attached as /dev/xvdc and /dev/xvdd respectively)
  3. Format the raid volume with your favorite file system:
    mkfs.ext4 /dev/md0
  4. Mount it on a temporary directory:
    mount /dev/md0 /srv/raid0
  5. Create some dummy data on /srv/raid0 directory:
    dd if=/dev/zero of=file1 bs=1024 count=65530 (run it about 10 times)
    It will fill 9GB of information onto the RAID meta device. Both the volumes contains data, given that each volume size is 5 GB.

Snapshot RAID volumes

  1. Create snapshots of the two volumes after suspending writes on those volumes.

Restore RAID Volumes

  1. Create the new EBS volumes from the snapshots we have created before.
  2. Launch an EC2 instance of either Amazon Linux or CentOS and attach the newly created EBS volumes.
  3. You can see the RAID meta device is created automatically by the OS itself. Use the fdisk -l command to check it out. Can you mount on a temporary mount point and verify whether 9GB of data is available or not?
  4. Now, launch an EC2 instance of Ubuntu and attach the newly created EBS volumes.
  5. Run the fdisk –l command and verify whether the RAID meta device created automatically by the OS itself. If not, run the following command to create the RAID meta device
    madam –assemble /dev/md0 /dev/xvdc /dev/xvdd
  6. Now you can see a meta device with the name /dev/md0. You can mount it and check its content.

Hope this article will help you to manage the RAID configuration on EBS volumes to gain the maximum IOPS.

Written by

Praveen Kumar Muppala

I have strong experience on Multiple Unix/Linux flavours, LAMP Stack, Monitoring Systems, Database, NoSQL. I love to explore the new concepts/services in Cloud Computing World. I have written 4 certifications in different flavours of Linux/Unix.

Related Posts

— February 11, 2019

WaitCondition Controls the Pace of AWS CloudFormation Templates

AWS's WaitCondition can be used with CloudFormation templates to ensure required resources are running.As you may already be aware, AWS CloudFormation is used for infrastructure automation by allowing you to write JSON templates to automatically install, configure, and bootstrap your ...

Read more
  • AWS
— January 24, 2019

The 9 AWS Certifications: Which is Right for You and Your Team?

As companies increasingly shift workloads to the public cloud, cloud computing has moved from a nice-to-have to a core competency in the enterprise. This shift requires a new set of skills to design, deploy, and manage applications in the cloud.As the market leader and most mature p...

Read more
  • AWS
  • AWS certifications
— November 28, 2018

Two New EC2 Instance Types Announced at AWS re:Invent 2018 – Monday Night Live

The announcements at re:Invent just keep on coming! Let’s look at what benefits these two new EC2 instance types offer and how these two new instances could be of benefit to you. If you're not too familiar with Amazon EC2, you might want to familiarize yourself by creating your first Am...

Read more
  • AWS
  • EC2
  • re:Invent 2018
— November 21, 2018

Google Cloud Certification: Preparation and Prerequisites

Google Cloud Platform (GCP) has evolved from being a niche player to a serious competitor to Amazon Web Services and Microsoft Azure. In 2018, research firm Gartner placed Google in the Leaders quadrant in its Magic Quadrant for Cloud Infrastructure as a Service for the first time. In t...

Read more
  • AWS
  • Azure
  • Google Cloud
Khash Nakhostin
— November 13, 2018

Understanding AWS VPC Egress Filtering Methods

In order to understand AWS VPC egress filtering methods, you first need to understand that security on AWS is governed by a shared responsibility model where both vendor and subscriber have various operational responsibilities. AWS assumes responsibility for the underlying infrastructur...

Read more
  • Aviatrix
  • AWS
  • VPC
— November 10, 2018

S3 FTP: Build a Reliable and Inexpensive FTP Server Using Amazon’s S3

Is it possible to create an S3 FTP file backup/transfer solution, minimizing associated file storage and capacity planning administration headache?FTP (File Transfer Protocol) is a fast and convenient way to transfer large files over the Internet. You might, at some point, have conf...

Read more
  • Amazon S3
  • AWS
— October 18, 2018

Microservices Architecture: Advantages and Drawbacks

Microservices are a way of breaking large software projects into loosely coupled modules, which communicate with each other through simple Application Programming Interfaces (APIs).Microservices have become increasingly popular over the past few years. The modular architectural style,...

Read more
  • AWS
  • Microservices
— October 2, 2018

What Are Best Practices for Tagging AWS Resources?

There are many use cases for tags, but what are the best practices for tagging AWS resources? In order for your organization to effectively manage resources (and your monthly AWS bill), you need to implement and adopt a thoughtful tagging strategy that makes sense for your business. The...

Read more
  • AWS
  • cost optimization
— September 26, 2018

How to Optimize Amazon S3 Performance

Amazon S3 is the most common storage options for many organizations, being object storage it is used for a wide variety of data types, from the smallest objects to huge datasets. All in all, Amazon S3 is a great service to store a wide scope of data types in a highly available and resil...

Read more
  • Amazon S3
  • AWS
— September 18, 2018

How to Optimize Cloud Costs with Spot Instances: New on Cloud Academy

One of the main promises of cloud computing is access to nearly endless capacity. However, it doesn’t come cheap. With the introduction of Spot Instances for Amazon Web Services’ Elastic Compute Cloud (AWS EC2) in 2009, spot instances have been a way for major cloud providers to sell sp...

Read more
  • AWS
  • Azure
  • Google Cloud
  • SpotInst
— August 23, 2018

What are the Benefits of Machine Learning in the Cloud?

A Comparison of Machine Learning Services on AWS, Azure, and Google CloudArtificial intelligence and machine learning are steadily making their way into enterprise applications in areas such as customer support, fraud detection, and business intelligence. There is every reason to beli...

Read more
  • AWS
  • Azure
  • Google Cloud
  • Machine Learning
— August 17, 2018

How to Use AWS CLI

The AWS Command Line Interface (CLI) is for managing your AWS services from a terminal session on your own client, allowing you to control and configure multiple AWS services.So you’ve been using AWS for awhile and finally feel comfortable clicking your way through all the services....

Read more
  • AWS