We are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux, Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performance supercomputing" platform. We are helping enterprises and service providers build their Al inference platforms for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit) hardware architecture. This is a high-impact, high-visibility role. The ideal candidate will play a pivotal role in supporting and maintaining our enterprise infrastructure stack, ensuring high availability and optimal performance across mission-critical Al & ML environments. This role involves close collaboration with global SRE and Platform teams to manage and troubleshoot enterprise systems and clusters.
Key Responsibilities:
Required Qualifications:
Soft Skills:
Why Join Us:

Keyskills: Linux Site Reliability Engineering Kubernetes Python Ansible CKA RHEL RHCE
About Aziro: Aziro is a trusted partner in Software Product Engineering Services and Digital Transformation projects, serving Fortune 100 companies, Silicon Valley-based ISVs, and global enterprises. Clientele & Global Presence: As an ISO 27001 and Great Place To Work Certified company, we co...