Monitor system performance and availability using tools like Prometheus, Grafana, AppDynamics, or similar
Actively participate in on-call rotation, incident response, and root cause analysis to maintain system health
Able to manage & maintain infrastructure in cloud or hybrid environments i
e
, GCP & IKPShould understand CI/CD pipelines and deployments
Continuously work on improving system observability, alerting, and logging mechanisms
Qualifications:Openness to a 24x7 project environment (shift rotation is required)
2-4 years of professional experience in a Production Engineering, DevOps, or similar role
Excellent communication skills are mandatory
A basic understanding of SRE concepts will be an added advantage
Job Classification
Industry: BankingFunctional Area / Department: Engineering - Software & QARole Category: DevOpsRole: Site Reliability EngineerEmployement Type: Full time