SRE Principles and Practices - Level 1

DifficultyBeginner
AVG Duration1h
Content
3

Description

Site Reliability Engineers act as the quality assurance check for operational systems and are a key element of organizational maturity within a growing Cloud IT organization. Site Reliability Engineering principles and practices are a core competency for any Cloud Operations team looking to improve their ability to consistently deploy and manage cloud applications across their services portfolio.

This learning path teaches the principles and practices of site reliability engineering to a level 1 standard. 

Learning Objectives

  • Be able to recognise and reduce Toil when managing production systems
  • Be able to describe key Site Reliability Engineering attributes, processes and workflows
  • Be able to understand key differences between SRE and DevOps
  • Be able to define and set Service Level Objectives and Error Budgets

Certificate

Your certificate for this learning path

Training Content

1
Course - Intermediate - 10m
SRE Principles and Practices
The course covers the principles and practices of site reliability engineering.
2
Course - Intermediate - 15m
SRE Reducing Toil
This course looks at how to reduce toil in site reliability engineering.
3
Course - Intermediate - 13m
SRE Service Level Objectives and Error Budgets
This course covers service level objectives and error budgets in site reliability engineering.
About the Author
Students114043
Labs65
Courses113
Learning paths152

Jeremy is a Content Lead Architect and DevOps SME here at Cloud Academy where he specializes in developing DevOps technical training documentation.

He has a strong background in software engineering, and has been coding with various languages, frameworks, and systems for the past 25+ years. In recent times, Jeremy has been focused on DevOps, Cloud (AWS, GCP, Azure), Security, Kubernetes, and Machine Learning.

Jeremy holds professional certifications for AWS, GCP, Terraform, Kubernetes (CKA, CKAD, CKS).