Reliability Engineering needs of mission critical systems and business processes
Candidate will assess high level architecture and design issues relating to platform enterprise software interactions with other systems
Application development infrastructure and middleware teams to ensure stability and reliability of the system Engineering will proactive detect issues within the applications platform network.
Candidate should have familiarity with Internet protocols such as HTTP DNS TCP and UDP and Linux development environment and well versed with DevOps.
Candidate will identify anti patterns optimization and support development of self-healing capabilities
Responsibilities Create operational tooling for monitoring self-healing infrastructures and testing
Design and create controlled in production systems
Work across teams identify and fix issues that affect systems reliability and performance
Dive into system and latent reliability issues service performance and capacity modeling of distributed systems at scale
Partner with development team to identify anti patterns and optimization strategies create fallback options and help develop self-healing capabilities across the enterprise in a sustainable manner
Requirements A passion for creating reliable applications and a systematic problem solving approach coupled with a strong sense of ownership and drive
3+ years of hands on experience with cloud-based technologies and tools in configuration management deployment monitoring and operations
Experience with Engineering tools such as Terraform, Ansible, Consul and Linux development environment.
Experience in Application Performance Managing Real User Monitoring infrastructure monitoring and log analysis tool such as Apica Nagios Sensu and Sumologic NewRelic with DevOps Continuous Delivery
Expertise in working in partnership with colleagues throughout the firm and in leading collaborative teams to achieve common goals
Experience in an Agile delivery environment
Experience as a hands on software engineer so you understand the core principles of the engineering work
Experience in communication and organization in large distributed teams
A Bachelor s degree is required
Employement Category:
Employement Type: Full timeIndustry: IT - SoftwareRole Category: General / Other SoftwareFunctional Area: Not ApplicableRole/Responsibilies: SRE Engineer