Your browser does not support javascript! Please enable it, otherwise web will not work for you.

SRE Leader @ Trianz

Home > IT Infrastructure Services

 SRE Leader

Job Description

Job Overview

We are seeking a seasoned infrastructure leader to own and evolve our AWS cloud platformthe foundation that powers our business 24/7. In this role, you will lead a high-performing team of Cloud Ops and SRE engineers, driving operational excellence while shaping our cloud architecture strategy and security posture for scale.

This role goes beyond operations; you will influence how we build, secure, and run our infrastructure, bridging the gap between reliability, innovation, and security. If you thrive on building resilient systems, mentoring technical teams, and making strategic architecture decisions that impact the entire organization, this is the role for you.

Key Responsibilities

Operational Excellence at Scale

  • Lead a unified CloudOps/SRE team across L1/L2/L3 support, ensuring seamless 24x7 operations through structured shift rotations and escalation frameworks.
  • Drive incident management excellencefrom first response to root cause analysis and continuous improvement.
  • Maintain and exceed operational KPIs: MTTA, MTTR, uptime SLAs, and availability objectives.
  • Oversee day-to-day operations across our AWS footprint: EC2, VPC, ELB/ALB, EKS/ECS, RDS/Aurora, S3, IAM, Lambda, CloudFront, and CloudWatch.

Architecture Leadership & Platform Evolution

  • Provide architectural oversight for production workloads, guiding teams on scalable, cost-optimized, and secure AWS designs.
  • Review and approve architecture patterns, deployment topologies, and infrastructure standards.
  • Partner with Cloud Architects to establish guardrails, reference architectures, and reusable Infrastructure-as-Code modules.
  • Create feedback loops where operational insights directly influence design decisionsensuring we build for observability, resilience, and efficiency.
  • Champion modernization initiatives: containerization, serverless adoption, and edge optimization strategies.

Security Posture & Compliance

  • Own cloud security governance across IAM, network segmentation, encryption, logging, and compliance.
  • Drive continuous security monitoring using AWS Security Hub, GuardDuty, IAM Access Analyzer, Config, Inspector, and third-party CSPM tools.
  • Ensure automated remediation for vulnerabilities, misconfigurations, and security baseline drift.
  • Maintain compliance with SOC2, ISO27001, CIS Benchmarks, and customer-specific security requirements.
  • Lead operational security hygiene: identity lifecycle management, least privilege enforcement, secrets management, and patch compliance.
  • Coordinate cloud security incident response with tight CloudOps-SecOps integration.

Automation & Tooling Strategy

  • Drive automation and tooling adoption across Monitoring & Observability (CloudWatch, Elastic Stack, distributed tracing), Logging & Analytics (CloudWatch Logs, ELK, OpenSearch), ITSM (ServiceNow, Jira Service Management), and IaC & Automation (CloudFormation, Terraform, Python, Shell scripting, GitOps workflows).
  • Build self-healing operations through automated provisioning, scaling, failover, and compliance checking.

Governance & Continuous Improvement

  • Establish and refine operational playbooks, runbooks, SOPs, and change control frameworks.
  • Implement ITIL-aligned processes for change, problem, and incident management.
  • Drive continuous improvement through automation, operational analytics, and team feedback loops.

Strategic Partnership & Communication

  • Collaborate with engineering, architecture, security, DevOps, and product teams to maintain platform reliability.
  • Provide executive-level insights on operational health, incident trends, risks, and improvement opportunities.
  • Influence business continuity planning, cloud cost governance, and infrastructure roadmap.

What You Bring

Experience & Leadership

  • 1520 years in infrastructure/operations with at least 8 years leading cloud or production operations teams.
  • Proven track record managing 24x7 support teams of 20+ engineers in high-availability AWS environments.
  • Experience scaling teams and operations while maintaining quality and reliability.

Technical Expertise

  • Deep knowledge of AWS architecture, networking, security, and distributed systems design.
  • Strong understanding of cloud security posture management, identity governance, and compliance frameworks (SOC2, ISO27001, CIS Benchmarks).
  • Expertise in incident management, SRE practices, reliability engineering, and operational KPIs.
  • Hands-on experience with Infrastructure-as-Code (Terraform, CloudFormation), automation, and GitOps workflows.

Strategic & Communication Skills

  • Ability to translate technical complexity into clear business impact for executive audiences.
  • Track record of building high-performing, collaborative teams.
  • Strong stakeholder management and cross-functional partnership capabilities.
  • Bias toward automation, continuous improvement, and operational excellence.

Why This Role Matters

As the technical leader, you will ensure our infrastructure is secure, resilient, and ready to scale with our business. Your decisions will directly impact system reliability, security posture, and operational efficiency across the organization. You will mentor engineers, influence architecture, and drive the strategic evolution of our cloud platform.

If you're ready to lead with impact, wed love to talk.


Job Classification

Industry: IT Services & Consulting
Functional Area / Department: IT & Information Security
Role Category: IT Infrastructure Services
Role: IT Operations Management
Employement Type: Full time

Contact Details:

Company: Trianz
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   Docker Site Reliability Engineering AWS Devops

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

SRE Manager

  • Virtusa
  • 12 - 17 years
  • Pune
  • 1 month ago
₹ Not Disclosed

OpenShift Site Reliability Engineer (SRE)

  • Tata Consultancy
  • 8 - 12 years
  • Kolkata
  • 2 mths ago
₹ Not Disclosed

SRE Monitoring- NOC Engineer

  • Persistent
  • 2 - 4 years
  • Pune
  • 3 mths ago
₹ Not Disclosed

Trianz

Trianz is a dynamic growth oriented firm that focuses on turnkey execution of strategic initiatives, combining a unique execution philosophy with business model, process transformation and technology implementation capabilities. Our practices are centered on industry process and technology led areas...