Monitor and Maintain Systems: Ensure the availability, performance, and reliability of our production environment by monitoring system health and responding to incidents.
Automation: Develop and implement automation tools to reduce manual intervention and improve system efficiency.
Collaboration: Work closely with development teams to design and implement scalable and reliable systems.
Performance Tuning: Analyze system metrics to identify performance bottlenecks and optimize system performance.
Incident Management: Lead incident response efforts, conduct root cause analysis, and implement preventive measures.
Documentation: Create and maintain comprehensive documentation for system architecture, processes, and procedures.
Capacity Planning: Conduct capacity planning and ensure systems can handle future growth
What You Know:
Experience: 9+ years of experience in site reliability engineering, operations, or software engineering.
Education: Bachelor s degree in computer science, Engineering, or a related field.
Technical Skills: Proficiency in scripting languages (e.g., Python, Ruby), experience with containerization (Docker, Kubernetes), and familiarity with cloud platforms (AWS, GCP, Azure).
System Knowledge: Strong understanding of Linux/Unix systems, networking, and infrastructure components.
Problem-Solving: Excellent troubleshooting and problem-solving skills.
Communication: Strong communication and collaboration skills to work effectively with cross-functional teams.
Certifications: Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus.
Education:
Bachelor s degree in computer science, Information Systems, Engineering, Computer Applications, or related field.
Benefits:
In addition to competitive salaries and benefits packages, Nisum India offers its employees some unique and fun extras:
Continuous Learning - Year-round training sessions are offered as part of skill enhancement certifications sponsored by the company on an as-need basis. We support our team to excel in their field.
Parental Medical Insurance - Nisum believes our team is the heart of our business and we want to make sure to take care of the heart of theirs. We offer opt-in parental medical insurance in addition to our medical benefits.
Activities -From the Nisum Premier Leagues cricket tournaments to hosted Hack-a-thon, Nisum employees can participate in a variety of team building activities such as skits, dances performance in addition to festival celebrations.
Free Meals - Free snacks and dinner are provided daily, in addition to subsidized lunch.
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Engineering - Software & QARole Category: DevOpsRole: Site Reliability EngineerEmployement Type: Full time