Site Reliability Engineers act as the quality assurance check for operational systems and are a key element of organizational maturity within a growing Cloud IT organization. Site Reliability Engineering principles and practices are a core competency for any Cloud Operations team looking to improve their ability to consistently deploy and manage cloud applications across their services portfolio.
This learning path teaches the principles and practices of site reliability engineering to a level 1 standard.
- Be able to recognise and reduce Toil when managing production systems
- Be able to describe key Site Reliability Engineering attributes, processes and workflows
- Be able to understand key differences between SRE and DevOps
- Be able to define and set Service Level Objectives and Error Budgets
Jeremy is a Content Lead Architect and DevOps SME here at Cloud Academy where he specializes in developing DevOps technical training documentation.
He has a strong background in software engineering, and has been coding with various languages, frameworks, and systems for the past 25+ years. In recent times, Jeremy has been focused on DevOps, Cloud (AWS, GCP, Azure), Security, Kubernetes, and Machine Learning.
Jeremy holds professional certifications for AWS, GCP, Terraform, Kubernetes (CKA, CKAD, CKS).