Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer - Aziro - Immediate Joiner

Home > Devops

 Site Reliability Engineer - Aziro - Immediate Joiner

Job Description

We are hiring a "SRE [Site Reliability Engineer] Infrastructure Support" engineer with deep expertise in Linux, Kubernetes, and hardware infrastructure management for our "Enterprise-grade high-performance supercomputing" platform. We are helping enterprises and service providers build their Al inference platforms for end users, powered by our state-of-the-art RDU (Reconfigurable Dataflow Unit) hardware architecture. This is a high-impact, high-visibility role. The ideal candidate will play a pivotal role in supporting and maintaining our enterprise infrastructure stack, ensuring high availability and optimal performance across mission-critical Al & ML environments. This role involves close collaboration with global SRE and Platform teams to manage and troubleshoot enterprise systems and clusters.


Key Responsibilities:

  • Linux Administration: Manage,configure,and optimize Linux servers (RHEL, Ubuntu, or similar), including patching, security hardening, and performance tuning
  • Kubernetes Administration: Deploy, manage, and troubleshoot Kubernetes clusters,ensuring reliability and scalability.
  • Hardware Infrastructure Management: Oversee physical data center infrastructure,including servers, storage, and networking hardware.
  • Security & Compliance: Apply security patches and upgrades for Linux-based Kubernetes environments and ensure compliance with organizational policies.
  • Collaboration & Support: Work closely with SRE and Platform teams worldwide to support enterprise systems and clusters.
  • Ticket-Based Case Management: Handle tickets efficiently using tools such as Salesforce or ServiceNow.

Required Qualifications:

  • Strong hands-on experience with Linux system administration (RHEL, Ubuntu, or similar). RHCSA/RHCE certification is a plus.
  • Solid understanding of Kubernetes administration; CKA/CKS certification is a plus.
  • Hands-on experience with bare-metal and hardware infrastructure (servers, storage, networking).
  • Good understanding of networking concepts (TCP/IP, DNS, Load Balancers, Firewalls); knowledge of Juniper OS is a plus.
  • Strong troubleshooting skills across hardware, OS, and Kubernetes environments.
  • Knowledge of automation tools such as Ansible, Python, Bash, or similar is a plus.
  • Familiarity with monitoring and observability tools (Prometheus, Grafana, ELK) is a plus.

Soft  Skills:

  • Strong communication, problem-solving, and collaboration abilities.
  • Ability to work effectively in fast-paced, dynamic environments and adapt to evolving Al & ML technologies.
  • Proactive mindset with a focus on automation,scalability, and operational excellence.

Why Join Us:

  • Work on cutting-edge Al & ML infrastructure supporting mission-critical applications.
  • Collaborate with global teams and gain exposure to advanced cloud-native and enterprise techno logies.
  • Opportunity to grow your expertise in Linux, Kubernetes, and data center operations

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Aziro
Location(s): Pune

+ View Contactajax loader


Keyskills:   Linux Site Reliability Engineering Kubernetes Python Ansible CKA RHEL RHCE

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

DevOps Engineer L4

  • Wipro HR Soniya
  • 5 - 8 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Azure Engineer

  • Cognizant
  • 5 - 8 years
  • Hyderabad
  • 8 days ago
₹ Not Disclosed

DevOps Engineer

  • Accenture
  • 3 - 6 years
  • Bengaluru
  • 9 days ago
₹ Not Disclosed

DevOps Engineer

  • Accenture
  • 3 - 6 years
  • Bengaluru
  • 9 days ago
₹ Not Disclosed

Aziro

About Aziro: Aziro is a trusted partner in Software Product Engineering Services and Digital Transformation projects, serving Fortune 100 companies, Silicon Valley-based ISVs, and global enterprises. Clientele & Global Presence: As an ISO 27001 and Great Place To Work Certified company, we co...