Skills Required:
Previous experience of Site Reliability, DevOps, or SaaS/Technical Operations in production
Large scale production UNIX/Linux operating system, application, and security maintenance
Strong networking basics and understanding of Azure networking services and CI/CD workflow for AKS clusters
Cloud resource orchestration using Terraform and configuration management using Ansible
Proficient with GitHub Actions to automate deployment pipeline and related workflows.
Ability to automate tasks using Bash and/or Python
Very good understanding of Azure Security policy, Azure DevOps, AKS, Azure WAF, Istio service mesh and ingress controller, etc.
Azure experience in a production environment, Kubernetes experience with Docker
Participate in technical discussion, provide consultative services to customer, guide the team based on requirements for delivery
Responsibilities:
Own operational availability, security, scalability, efficiency, monitoring, instrumentation, and overall service reliability of the environment for assigned project.
Collaborate across Agile teams with Architects, Developers, Quality, Data, Security, and other Operations engineers on designing and implementing highly reliable solutions.
Embrace Site Reliability Engineering principles of proactivity, automation, cross-functional collaboration, data-driven decision making, and fast+safe failover, to continually improve our technology and culture.
Enhance our infrastructure, tooling, and processes to extend operability as a self-service function for other groups in the engineering value stream.
Participate in a rotating on-call schedule to troubleshoot and resolve production escalations.
Skills that will add value:
Experience on CSPM and vulnerability management
Well versed with DevOps tools such as Jenkins, helm
Experience of Terraform Cloud
Ability to write code in at least one programming language (e.g., Python, Perl, Java, Ruby, Go)
Monitoring Tool - Grafana / Prometheus, ELK and DataDog
Experience managing large scale Kafka clusters
Keyskills: security policy project administration reliability engineering configuration management java perl ruby bash mesh cloud agile azure python devops github jenkins security pipeline schedule datadriven decision making
Xoriant Corporation is a product engineering and services company, serving technology startups as well as mid-size to large corporations. We offer a flexible blend of onsite, offsite and offshore services from our Global delivery centers ( Sunnyvale, New Jersey, Mumbai, Pune, Gurgaon, Kolkata and B...