Lead Software Engineer, Cloud Site Reliability (SRE)Required Skills:
7 12 years of experience in CloudOps / SRE / NOC environments (24x7 operations)
Strong expertise in Azure Infrastructure (VMs, Networking, Storage)
Hands-on experience with Azure Kubernetes Service (AKS), Kubernetes, Docker
Strong experience with monitoring and observability tools (Datadog, Azure Monitor, Prometheus, Grafana)
Proven experience in Incident Management / Major Incident Handling, Monthly reporting
Experience with Infrastructure as Code (Terraform, ARM templates, Helm)
Scripting skills in PowerShell, Python, or Bash
Experience with ServiceNow (Incident, Problem, Change modules and dashboards)
Strong reporting and analytics experience using Power BI and exposure to tools like Power Automate
Good understanding of distributed systems and cloud-native architecture
Excellent communication, leadership, and problem-solving skills
Preferred Skills:
Experience in multi-cloud environments (AWS/GCP)
Exposure to AIOps / predictive monitoring / self-healing systems
Azure / Kubernetes certifications

Keyskills: Performance tuning Change management Operational excellence Networking Contract management Incident management Troubleshooting Analytics Capacity planning Python
This is a manufacturing, Trading and Retail Sales Company.\r\nProducts: Door, Chokhat, Plywood, Board etc.