Cluster node failures are inevitable when running large clusters. This Lab teaches you how to detect, diagnose, and remedy Kubernetes cluster node failures. You will use tools included in Kubernetes, such as kubectl, as well as a variety of Linux operating system tools like systemctl, journalctl, and ssh to build a comprehensive Kubernetes troubleshooting toolkit. In addition to reacting to failures, the Lab points out some ways that you can proactively reduce the chance of failures when working with Kubernetes.
This Lab is valuable to anyone working with Kubernetes, but the content has been prepared considering topics described in the Certified Kubernetes Administrator (CKA) Exam Curriculum. Completion of the Lab will help you get hands-on experience, which is essential for passing the CKA exam.
Upon completion of this Lab, you will be able to:
You should be familiar with:
June 20th, 2023 - Resolved VCF issue
June 13th, 2023 - Updated Kubernetes version
Daniel is a Lab Operations Specialist with experience as an AWS Engineer and Operations Specialist. He holds the AWS DevOps Engineer Professional, AWS Developer Associate, AWS SysOps Administrator Associate, Certified Kubernetes Administrator, Microsoft Certified: Azure Administrator Associate, and HashiCorp Certified: Terraform Associate certifications. Daniel is focused on lab operations and enjoys continuously building his knowledge.