Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Site Reliability Engineer @ Rfpio

Home > Devops

 Senior Site Reliability Engineer

Job Description

About the Role
Responsive is looking for a Senior Site Reliability Engineer (SRE) to improve system reliability, scalability, and operational efficiency. This role involves working on automation, monitoring, and performance optimization to ensure high availability of our SaaS platform.
What You ll Be Doing ?
  • Reliability & Scalability: Design and implement resilient, scalable, and highly available systems while ensuring continuous performance improvements.
  • Automation & Incident Response: Automate operational tasks, develop self-healing mechanisms, and reduce MTTR with proactive monitoring and observability tools like Splunk, Grafana, and Prometheus.
  • Deployment & Security: Plan and manage system releases, implement best security practices for authentication, certificates, and secret management, and ensure compliance with industry standards.
  • Performance Optimization: Conduct failure testing, optimize system performance, and enhance infrastructure stability through structured chaos testing.
  • Collaboration & Leadership: Work with cross-functional teams to improve development workflows, mentor engineers, and drive best practices in Site Reliability Engineering.
What We re Looking For ?
Education :
  • Bachelor s degree in Computer Science, Information Technology, or a related field.
Experience:
  • 5 to 8 years of experience in Site Reliability Engineering, DevOps, or Infrastructure Engineering .
Skills, Qualifications & Ability:
  • Experience in improving system availability and reducing MTTR through automation and monitoring.
  • Hands-on experience with any one of the cloud services (AWS, GCP, or Azure) .
  • Incident Response & Debugging: Understanding application logs, troubleshooting performance bottlenecks, and debugging issues in production.
  • Strong background in incident response, triage, and post-mortem analysis to enhance operational efficiency.
  • Ability to troubleshoot and optimize web applications .
  • Hands-on experience in developing and implementing self-healing systems for high availability.
  • Familiarity with chaos engineering principles to test and enhance system resilience.
  • Strong understanding of CI/CD pipelines and deployment automation like blue-green deployments and rolling updates.
  • Experience in monitoring, logging, and observability tools such as Splunk, Prometheus, or Grafana.
  • Expertise in release management, hotfix coordination, and system deployment strategies with minimal downtime.
  • Proven ability to optimize system performance using monitoring and observability tools.
  • Experience in mentoring teams and providing technical leadership in reliability engineering.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Rfpio
Location(s): Coimbatore

+ View Contactajax loader


Keyskills:   Computer science Automation Debugging Data processing splunk Troubleshooting Information technology Release management Operations Monitoring

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Azure Devops Engineer

  • HCLTech
  • 5 - 10 years
  • Bengaluru
  • 1 day ago
₹ Not Disclosed

Devops Engineer

  • Cybage
  • 10 - 12 years
  • Pune
  • 7 hours ago
₹ Not Disclosed

Devops Engineer

  • Top Rated
  • 4 - 6 years
  • Mumbai
  • 8 hours ago
₹ 12-18 Lacs P.A.

Azure Devops Engineer

  • Planbee Strategy
  • 3 - 6 years
  • Kolkata
  • 8 hours ago
₹ .25-6 Lacs P.A.

Rfpio

Responsive is the global leader in Strategic Response Management software, transforming how organizations share and exchange critical information. The AI-powered Responsive Platform is purpose-built to manage responses at scale, empowering companies across the world to accelerate growth, mitigate ri...