Job Description
Responsibilities: Designing, implementing, and maintaining the overall DevOps infrastructure and tooling, ensuring scalability, reliability, and performance. Collaborating with cross-functional teams to understand project requirements, provide technical expertise, and implement efficient CI/CD pipelines. Developing and maintaining automated deployment processes, utilising tools such as Jenkins, Ansible, Docker, and Kubernetes. Managing and optimising cloud-based infrastructure on AWS, ensuring cost-effectiveness and scalability. Overseeing the deployment and management of both monolithic and microservices-based applications. Implementing monitoring and alerting solutions using Grafana, Prometheus, and other relevant tools to ensure high availability and performance of systems. Handling log management and analysis to identify and resolve issues quickly. Leading root cause analysis (RCA) efforts for production incidents and implementing preventive measures. . Continuously improving the DevOps processes, identifying bottlenecks, and implementing best practices. Mentoring and providing guidance to junior members of the DevOps team. Keeping up-to-date with industry trends and best practices in DevOps and cloud technologies. Design, deploy, and maintain bare metal servers in an on-premise environment. Ensure the stability, security, and scalability of the server infrastructure. Collaborate with hardware vendors for procurement, troubleshooting, and maintenance Implement and manage databases on-premise, ensuring high availability and optimal performance. Set up and maintain database replication for data synchronisation and failover. Implement robust backup and recovery strategies to prevent data loss. Monitor database performance and troubleshoot issues proactively. Collaborate with developers to optimize SQL queries and database schema for efficiency. Requirements: 5+ years of experience in DevOps or a related role, with a strong background in building and maintaining CI/CD pipelines. Deep understanding of networking concepts and protocols. Knowledge of security best practices for on-premise server and database environments. Extensive experience in managing on premise dedicated servers Proficiency in tools such as Jenkins, Ansible, Docker, and Kubernetes. Good to have experience with cloud platforms, particularly AWS. Strong knowledge of containerisation, microservices architecture, and orchestration. Familiarity with log management and analysis tools. Experience with monitoring and alerting solutions such as Grafana and Prometheus. Proven experience in root cause analysis (RCA) for production incidents. Excellent problem-solving skills and the ability to troubleshoot complex issues. Strong scripting and automation skills (e. g., Bash, Python). Solid understanding of agile methodologies and version control systems. Excellent communication and interpersonal skills, with the ability to collaborate effectively across teams
Employement Category:
Employement Type: Full time
Industry: Others
Role Category: Application Programming / Maintenance
Functional Area: Not Applicable
Role/Responsibilies: DevOps Dedicated Engineer
Keyskills:
DevOps
Jenkins
Ansible
Docker
Kubernetes
AWS
Networking
Security
Containerization
Microservices
Orchestration
Log Management
Root Cause Analysis
Scripting
Automation
Agile Methodologies
Version Control
CICD
Grafana
Prometheus