We are looking for a Senior Kubernetes Platform Engineer with 10+ years of infrastructure experience to design and implement the Zero-Touch Build, Upgrade, and Certification pipeline for our on-premises GPU cloud platform. This role focuses on automating the Kubernetes layer and its dependencies (e.g., GPU drivers, networking, runtime) using 100% GitOps workflows. You will work across teams to deliver a fully declarative, scalable, and reproducible infrastructure stackfrom hardware to Kubernetes and platform services.
Key Responsibilities
Architect and implement GitOps-driven Kubernetes cluster lifecycle automation using tools like kubeadm, ClusterAPI, Helm, and Argo CD.
Develop and manage declarative infrastructure components for:
o GPU stack deployment (e.g., NVIDIA GPU Operator)
o Container runtime configuration (Containerd)
o Networking layers (CNI plugins like Calico, Cilium, etc.)

Keyskills: python kubernetes