SRE Engineer Key Responsibilities: System Monitoring & Incident Response: Develop and implement monitoring tools to
ensure system health. Respond to incidents, troubleshoot issues, and provide timely
resolutions. Automation & Infrastructure as Code: Design and implement automation solutions to
manage infrastructure and application deployment using tools like Terraform, Ansible, or
similar technologies.
Performance Optimization: Analyze system performance and capacity; implement
improvements to enhance system reliability and efficiency.
Collaboration: Work closely with development teams to improve system design and
deployment practices. Advocate for reliability improvements in the software development
lifecycle.
Documentation & Reporting: Maintain thorough documentation of system architecture,
processes, and incident response procedures. Provide regular reports on system performance
and reliability metrics.
Disaster Recovery & Backup: Design and implement disaster recovery plans and ensure
effective data backup solutions are in place.
Security Best Practices: Collaborate with security teams to ensure best practices are
followed to protect systems and data.
Qualifications:
Bachelors degree in Computer Science, Engineering, related field, or equivalent experience.
Proven experience in a Site Reliability Engineering, DevOps, or related role.
Strong knowledge of cloud services (AWS, Azure, Google Cloud) and container orchestration
(Kubernetes, Docker).
Proficiency in scripting languages (Python, Bash, ansible, etc.) and experience with CI/CD
tools (Jenkins, GitLab CI/CD, etc.) and infrastructure as code tools (Terraform, Ansible).
3+ years of proven track record with production monitoring using Prometheus, ELK, Grafana
and OpsGenie/PagerDuty.
3+ years of experience in Linux system administration (preferably Ubuntu)
Solid understanding of networking, security, system architecture, and data center operations
in a fast-paced, 24x7, production environment
Strong understanding of networking concepts, protocols (TCP/IP, BGP, OSPF), and
technologies (LAN, WAN, VPN) with proficiency in network monitoring tools and software.
Excellent problem-solving skills and a proactive mindset with excellent communication and
teamwork skills.
In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment
based on race, color, ancestry, national origin, religion, age, gender, marital domestic partner status,
sexual orientation, gender identity, disability status, or veteran status. Above and beyond
discrimination or harassment based on "protected categories," Pango Group is committed to being an
inclusive community where all feel welcome. Whether blatant or hidden, barriers to success have no
place at Pango Group.
As part of Pango Group, you will:
Solve real customer problems. Pango Groups point solutions allow consumers to address their immediate cyber
protection needs. Our mandate is to continuously anticipate our customers evolving digital security needs to create
best-in-class solutions aimed at keeping them safe.
See your impact. We are a scrappy, nimble organization where individual contributions are needed and valued.
You will see your impact every day.
Accelerate your career. As we expand, you will have the opportunity to learn new technologies, products, and
markets in a fast-paced, growth-oriented environment.
Most importantly, you'll get to work with other talented people at a company where people matter. If you want to
put your fingerprint on an organization and leapfrog your growth, this is the place for you.
In keeping with our beliefs and goals, no employee or applicant will face discrimination or harassment based on
race, color, ancestry, national origin, religion, age, gender, marital domestic partner status, sexual orientation,
gender identity, disability status, or veteran status. Above and beyond discrimination or harassment based on
"protected categories," Pango Group is committed to being an inclusive community where all feel welcome.
Whether blatant or hidden, barriers to success have no place at Pango Group.
Important privacy information for United States based job applicants can be found here.
Keyskills: Linux Networking Site Reliability Engineering Linux Administration Devops
Accion Labs is a Product Engineering Company Helping to Transform Businesses Through Emerging Technologies. This includes Web 2.0, Open Source, SaaS/Cloud, Mobility, IT Operations Management/ITSM, Big Data, and traditional BI/DW. Through nine global offices and a rapid-response delivery model, Accio...