Managing for Failure in AWS

2m 25s

In this lesson, we will cover a few strategies used to handle failures in AWS and how to recover from disasters.

Learning Objectives

  • Fault Isolation
  • Testing reliability
  • Disaster recovery
  • Designing for auto-recovery

Intended Audience

  • Those who are already familiar with AWS infrastructure and its main components
  • DevOps engineers will also benefit from this lesson by expanding their knowledge in the area of infrastructure resilience, auto-recovery, and testing


  • EC2 operations
  • General AWS networking knowledge
  • Familiarity with Auto Scaling
  • Cloud storage solutions
About the Author
Carlos Rivas
Sr. AWS Content Creator
Learning paths

Software Development has been my craft for over 2 decades. In recent years, I was introduced to the world of "Infrastructure as Code" and Cloud Computing.
I loved it! -- it re-sparked my interest in staying on the cutting edge of technology.

Colleagues regard me as a mentor and leader in my areas of expertise and also as the person to call when production servers crash and we need the App back online quickly.

My primary skills are:
★ Software Development ( Java, PHP, Python and others )
★ Cloud Computing Design and Implementation
★ DevOps: Continuous Delivery and Integration


Covered Topics