Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Site Reliability Engineer @ Coforge

Home > Devops

 Site Reliability Engineer

Job Description

Location : Greater Noida, Pune, Hyderabad, Bangalore

About the Role

We are seeking an SRE Engineer focused on Observability, Kubernetes, and Cloud Infrastructure to support our large-scale GCP/AWS/EKS platform. This role is central to improving SLO reliability, logging pipelines, distributed tracing, dashboards, and automated diagnostics across 10,000+ applications running in EKS.


Responsibilities

  • Own observability stack: Prometheus/Grafana, Open Telemetry, Loki/ELK/Splunk, Jaeger, Alertmanager, SLO frameworks.
  • Build intelligent monitoring pipelines and ensure high reliability of metric ingestion, log ingestion, tracing, and analytics systems.
  • Develop Terraform modules for observability infrastructure, K8s components, cluster add-ons, and monitoring services.
  • Improve reliability of AWS/GCP/EKS clusters through automation, performance tuning, capacity modeling, and event-driven remediation.
  • Build AI-assisted diagnostics for anomaly detection, auto-alert tuning, automated playbooks, and noise reduction.
  • Partner with Platform Engineering to ensure Istio/service mesh telemetry, API server health, and node-level insights.
  • Lead operational readiness, SLO reporting, incident management, and root cause analysis for platform outages.

Qualifications

  • 48 years in SRE, Infrastructure, or Kubernetes operations.
  • Strong knowledge of EKS/ECS/GKE, Kubernetes internals, and cluster operations.
  • Expertise in observability stacks (Prometheus, OTel, Grafana, ELK, Datadog, Splunk).
  • Advanced Terraform IaC and automation skills (Python/Go preferred).
  • Experience with CI/CD, cloud networking, service mesh (Istio), and capacity planning.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Coforge
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   Terraform Sre Site Reliability Engineering Kubernetes Prometheus

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Application Support Engineer

  • Accenture
  • 3 - 8 years
  • Ahmedabad
  • 5 days ago
₹ Not Disclosed

Custom Software Engineer

  • Accenture
  • 2 - 5 years
  • Hyderabad
  • 5 days ago
₹ Not Disclosed

DevOps Engineer

  • Accenture
  • 3 - 6 years
  • Pune
  • 5 days ago
₹ Not Disclosed

Aws Devops Engineer

  • Capgemini
  • 4 - 9 years
  • Bengaluru
  • 10 days ago
₹ Not Disclosed

Coforge

Coforge is a leading global IT solutions organization, enabling its clients to transform at the intersect of unparalleled domain expertise and emerging technologies to achieve real-world business impact. A focus on very select industries, a detailed understanding of the underlying processes of those...