We've an Urgent requirement for Site Reliability Engineer
Build & run our cloud SaaS environments by monitoring availability, optimizing system performance and taking a holistic view of system health.
Provide primary operational support and engineering for multiple large distributed software applicationsBuild software and systems to manage and automate platform infrastructure and applications. Balance feature development speed and reliability with well defined service level objectivesImprove reliability, quality, and time to market of our suite of software as a service solution. Partner with development teams to improve services through rigorous testing and release procedures Participate in system design consulting, platform management, and capacity planning to create sustainable systems and services through automation and uplifts University degree in Computer Science or related discipline. Solid experience in implementing public cloud conceptsand models Programming experience with at least one modern language such asPython, Ruby, Go or Java including object oriented design Experiencewith distributed storage technologies likeS3as well as dynamic resource management frameworks asKubernetes, Mesos, DockerA proactive approach to spotting problems, areas for improvement, and performance bottlenecks Drive to standardize, streamline, and automate processes Experience in operating highly available services design of large scale distributed systems, preferably usingAWS technologies.
Keyskills: python ansible kubernetes gradle puppet karmajasmine mockito experience jenkins ci tool aws technologies s3