Job Description
*
Lead Consultant Site Reliability Engineering
Function : Operation
Location: Mumbai, India
With a startup spirit and 90,000+ curious and courageous minds, we have the expertise to go deep with the world s biggest brands and we have fun doing it. Now, we re calling all you rule-breakers and risk-takers who see the world differently, and are bold enough to reinvent it. Come, transform with us.
Are you the one we are looking for
Inviting applications for the role of Lead Consultant Site Reliability Engineering
In this role, you will be working in close partnership with our internal clients, including Risk Managers and Collateral Operations, Reliability and Production Engineering teams to improve Stability, Reliability and Supportability of our evolving platform. With a broad range of applications across the department, you will have the opportunity to collaborate, think big and deliver highly reliable, scalable, cloud ready systems in an environment that provides support and mentorship needed to learn and grow.
Responsibilities
- Apply software and systems engineering principles to build and run complex, distributed, scalable and fault-tolerant systems
- Implement highly observable, available and testable services
- Design, analyse, and troubleshoot complex legacy codebases to fix stability concerns in a pragmatic manner
- Debug, optimize code, automate routine tasks and reduce tech debt with a focus on improving efficiency and reducing costs
- Scale systems sustainably through mechanisms like automation; evolve systems by pushing for changes that improve reliability and velocity
- Practice sustainable incident response and blameless postmortems
Qualifications we seek in you
Minimum qualifications
- Bachelor s Degree in Engineering/Technology with Computer Science background from a top tier university
- Excellent written and verbal communication skills
Preferred qualifications
- Programming: Proficiency in Core Java 8.0 or above with knowledge of frequently used API/Transport/Frameworks (Ex. Spring, CXF, JSON, XML, Swagger, MQ, PubSub, REST, Kafka, Log4j)
- Scripting: Python, Perl, Javascript, Unix shell/bash scripting
- Testing: Knowledge of Unit and Integration tests with experience in writing automated tests using JUnit, Mockito, Cucumber or similar testing frameworks
- Site Reliability Engineering (SRE): Experience in building and working on software systems that are able to function correctly and operate in a failure free manner, able to tolerate faults with an ability to withstand disruption and recover in a graceful manner avoiding downtime, Highly observable with monitoring and tracking of QoS parameters such as - Availability, Latency, Quality, Correctness, Completeness, Error budget etc. using Service level indicators (SLI) mapped to Service level objectives (SLO)
- Knowledge of Enterprise tooling related to Monitoring and Alerting with Monitoring as code: AppDynamics, Prometheus, Jaegar, Grafana, Splunk, Ansible, Jenkins, GitHub, SockEye, PagerDuty
- Understanding of containers and virtualization with Infrastructure as code (Docker, Kubernetes and OpenShift)
- Knowledge of distributed Caching frameworks like Hazelcast or Gemfire
- Experience in developing enterprise scale n-tier applications in financial domainSystematic problem-solving approach, coupled with effective communication skills and a sense of drive
- Software Engineering: Understand Computer Science fundamental concepts and best practices related to Software engineering with a focus on Event Driven Architecture and Design using Microservices
- Systems Engineering: Familiarity of Unix/Linux operating systems with knowledge of shell/bash scripting
- Database Engineering: Experience in Relational/NoSQL database with working knowledge of SQL for DDL/DML operations
- Site Reliability Engineering (SRE): Familiarity with fundamental concepts related to SRE like Reliability, Resiliency, Recoverability, Fault Tolerance, Elasticity, SLO/SLI, Error budgets, Observability and Monitoring and metrics related to these.
- Release Engineering: Familiarity with CI/CD pipelines Blue/green deployments, Canarying, Feature flag based releases
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. For more information, visit www.genpact.com. Follow us on Twitter, Facebook, LinkedIn, and YouTube.
,
Employement Category:
Employement Type: Full time
Industry: IT - Software
Role Category: General / Other Software
Functional Area: Not Applicable
Role/Responsibilies: Lead Consultant Site Reliability Engineering
Contact Details:
Company: Genpact India
Location(s): Mumbai