710 years of experience in DevOps, Site Reliability Engineering (SRE), or Systems Engineering.
Strong troubleshooting expertise across OS, networking, applications, and cloud infrastructure.
Proficient in log analysis, using tools like grep, journalctl, GCP Logging, or ELK stack.
Proficient in scripting with Python, with experience building automation and tooling.
Deep understanding of Google Cloud Platform (GCP) or similar cloud environments (AWS/Azure).
Good knowledge of networking fundamentals TCP/IP, DNS, NAT, routing, firewalls.
Experience with bug fixing, issue isolation, and root cause analysis in production environments.
Experience with Kubernetes, Docker, and container lifecycle management.
Hands-on experience with Infrastructure as Code (IaC) and configuration management.
Preferred Qualifications:
GCP certifications (e.g., Professional DevOps Engineer, Associate Cloud Engineer).
Experience with service mesh, service discovery, and secure microservices communication.
Exposure to hybrid or multi-cloud environments and related tooling.
Familiarity with incident response procedures, SLOs, SLIs, and error budgets.
Strong communication skills and ability to work cross-functionally in fast-paced teams.
Job Classification
Industry: IT Services & Consulting Functional Area / Department: Engineering - Software & QA Role Category: DevOps Role: Site Reliability Engineer Employement Type: Full time