learning path

SRE Principles and Practices - Level 1

Enhance your skill setDevelop essential skills for thriving in real-world scenarios.
Stay focused, stay committedBoost your learning journey by enrolling: stay focused, consistent and achieve your goals with ease.
Earn a certificate of completionShow your skills and build your credibility when you include them in your resume and LinkedIn profile.

Site Reliability Engineering (SRE) is a discipline that combines software engineering and systems engineering to build and run large-scale, highly available systems. The principles and practices of SRE are designed to ensure that a company's systems are reliable, efficient, and easy to operate.

One of the key principles of SRE is to treat system reliability as a software problem. This means that SRE teams focus on building automation and monitoring systems to prevent outages and quickly identify and fix any problems that do occur.

This course teaches the principles and practices of site reliability engineering to a level 1 standard. 

Learning Objectives

  • Be able to describe key Site Reliability Engineering attributes, processes and workflows
  • Be able to understand key differences between SRE and DevOps
  • Be able to define and set Service Level Objectives and Error Budgets
Your certificate for this learning path

About the Author

Jeremy Cook, opens in a new tab
Content Lead Architect
Learning paths

Jeremy is a Content Lead Architect and DevOps SME here at Cloud Academy where he specializes in developing DevOps technical training documentation.

He has a strong background in software engineering, and has been coding with various languages, frameworks, and systems for the past 25+ years. In recent times, Jeremy has been focused on DevOps, Cloud (AWS, Azure, GCP), Security, Kubernetes, and Machine Learning.

Jeremy holds professional certifications for AWS, Azure, GCP, Terraform, Kubernetes (CKA, CKAD, CKS).

Covered Topics