You are the operational owner for one or more customer-facing systems and coordinate all operational changes.
You are the expert in AWS cloud technologies and consult teams on available technologies, services and best practices.
You design your systems for maximum robustness and maximize the operational performance.
You facilitate a permanent knowledge exchange by sharing your experiences, training others and continuous learning.
Your skills:
As Site Reliability Engineer you aim to solve operational problems by software and have experience in the following areas:
Practical experience in operating customer-facing services in the Amazon AWS cloud environment, having an AWS Solutions Architect Professional certification or equivalent knowledge.
Azure knowledge is a plus
A reference project where you have optimized a system for scalability, performance and reliability.
Software development experience and programming skills in multiple scripting or higher languages.
You have designed comprehensive monitoring tools (Icinga, AWS Cloudwatch, etc.), to collect and analyse data for further system optimization.
Building infrastructure out of source code using Terraform.
Experience with distributed systems and complex network architecture DNS, firewalls, routing, tunnels are known to you.
Knowledge in Site Reliability concepts e.g. having an SRE foundation certification.
Linux server administration
Experience in agile software environments e.g. Scrum or SAFe
Software testing and designing test strategies is a plus
ITIL foundation certification is a plus
Language: Fluent English a must, other languages a plus.
Job Classification
Industry: IT Services & Consulting Functional Area: IT Services & Consulting Role Category: IT Infrastructure Services Role: IT Infrastructure Services - Other Employement Type: Full time