Own the cloud operations of the analytics platform and drive high-quality customer experience of the product by ensuring SLA and consistent performance.
Manage DevOps teams.
Mentor L2 teams in troubleshooting production issues
Enable Service Desk to do 24X7 L1 monitoring
Track incidents and perform RCA for every incident and focus on prevention.
Ensure SOP s are defined and implemented religiously.
Develop CI/CD pipelines and change management
Ongoing change management for scale
Communicate effectively with customer facing team to address their concerns on infrastructure availability, performance, and security
Collaboration with Engineering team:
Build Tools and Systems for Observability
Plan releases and patch deployments
Scale of applications and infrastructure to ensure best customer experience
Infrastructure Management:
Application performance monitoring and optimization
Application and cloud security management
Automation: IaC, Disaster Recovery, BCP etc.
Cost Optimization:
Monitor costs of all resources and trace them to respective accounts
Identify cost optimization improvements and ensure optimal costs without compromising on performance
Own Security of data, infrastructure, and applications. Ensure best practices are followed for HIPAA, ISO 27001 and SOC2 compliance
Systematically develop documentation for each data solution and make sure it is up to date and reflects current business rules and definitions.
Building strong partnerships with colleagues at all levels.
Key Responsibilities:
A Bachelor s Degree in Software Engineering or Information Technology
4+ years of experience in cloud operations, DevOps in Azure and AWS environment
1+ years experience in creating and managing Kubernetes clusters
Experience with terraform, ansible or similar
Experience with scripting and programming language (Python, Golang or similar)
Strong experience in deploying & operating large and complex applications on cloud
Understanding of CI/CD, observability, APM and security of cloud-based applications
Experience with open-source tools like Grafana, Prometheus, etc.
Exposure to architecture of cloud based ETL and Analytics tools
Experience of building a leading a small team
Knowledge of AGILE ways of working
The ability to analyse complex technical information
An awareness of current issues affecting the industry and its technologies
A meticulous and organized approach to work
A logical, analytical, and creative approach to problem-solving
A thorough, detail-oriented work style
Function well in a fast-paced, rapidly changing environment
Communicate effectively with people at all levels of the organization
Qualification:
Any Graduate
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Engineering - Software & QARole Category: DevOpsRole: DevOps ManagerEmployement Type: Full time