Lead Site Reliability Engineer-gcp Ops, Terraform, Python, Etc. @ Optum

Home > Devops

Lead Site Reliability Engineer-gcp Ops, Terraform, Python, Etc.

Optum
1 - 6 years
Bengaluru
3 months ago
Email to a friend
Report this job

Job Description

Position Overview:

We are seeking a motivated and detail-oriented Site Reliability Engineer (SRE) to help us improve the reliability, scalability, and performance of our systems. As an SRE, you will collaborate with cross-functional teams to design, build, and maintain the infrastructure and tools that support our applications. This is an excellent opportunity for someone who is passionate about DevOps, automation, and cloud-native technologies.

Primary Responsibilities:

Design, deploy, and maintain Kubernetes-based infrastructure to ensure high availability and scalability of applications
Build and manage CI/CD pipelines using GitHub Actions to enable fast and reliable deployments
Use Terraform to provision and manage infrastructure in Google Cloud Platform (GCP)
Manage and optimize Apache Kafka-based systems to ensure reliable message streaming and data processing.
Monitor and improve system performance and reliability using Prometheus and Grafana
Collaborate with developers to automate workflows and implement best practices for infrastructure-as-code (IaC)
Write Python scripts for automation and tooling to enhance operational efficiency
Troubleshoot and resolve system issues to minimize downtime and impact on users
Participate in on-call rotations and incident response to ensure high service reliability
Comply with the terms and conditions of the employment contract, company policies and procedures, and any and all directives (such as, but not limited to, transfer and/or re-assignment to different work locations, change in teams and/or work shifts, policies in regards to flexibility of work benefits and/or work environment, alternative work arrangements, and other decisions that may arise due to the changing business environment). The Company may adopt, vary or rescind these policies and directives in its absolute discretion and without any limitation (implied or otherwise) on its ability to do so

Required Qualifications:

Bachelor''s degree in Computer Science, Information Technology, or related field (or equivalent work experience)
1+ years of experience in DevOps, SRE, or related roles (internships and project experience are acceptable for entry-level candidates)
Hands-on experience with Kubernetes for deploying and managing containerized applications
Experience with Apache Kafka for building, maintaining, and troubleshooting message-driven systems
Experience using Prometheus and Grafana for monitoring and observability
Familiarity with Google Cloud Platform (GCP) services such as Compute Engine, Kubernetes Engine, and Cloud Storage
Understanding of GitHub Actions for creating and maintaining CI/CD pipelines
Basic to intermediate knowledge of Terraform for infrastructure provisioning and management
Proficiency in Python for scripting, automation, and tooling
Proven solid problem-solving skills and an eagerness to learn new technologies
Proven excellent communication and teamwork skills

Preferred Qualifications:

Experience with debugging and optimizing distributed systems
Experience with Golang for developing infrastructure tools or cloud-native applications
Familiarity with other cloud providers (e.g., AWS or Azure)
Knowledge of Helm for Kubernetes package management
Exposure to security best practices for cloud infrastructure
Knowledge of Java for developing and troubleshooting backend systems
Familiarity with DataHub or similar data cataloging and metadata management platforms
Understanding of Artificial Intelligence (AI) concepts and tools, such as building or managing machine learning pipelines, integrating AI models, or working with ML platforms like TensorFlow, PyTorch, or Vertex AI

Job Classification

Industry: Retail
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Optum
Location(s): Bengaluru

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: kubernetes devops python sre aws continuous integration vertex golang reliability ci/cd hibernate helm artificial intelligence spring tensorflow java gcp pytorch debugging prometheus github microsoft azure machine learning grafana kafka terraform

Job seems aged, it may have been expired!
Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Opening For Lead DevOps engineer!! Pune/Hyderabad

Tech Mahindra

9 - 14 years

Hyderabad

14 days ago

₹ Not Disclosed

Lead Azure DevOps Engineer

Cirruslabs

10 - 20 years

Hyderabad

5 days ago

₹ 15-30 Lacs P.A.

Lead DevOps Engineer- 10+ yrs- Bangalore

Crescendo Global

11 - 19 years

Bengaluru

8 days ago

₹ Not Disclosed

Site Reliability Engineer (Azure )

Ltimindtree

5 - 9 years

Bengaluru

9 days ago

₹ Not Disclosed

Optum

About: OptumInsight India Pvt Ltd, a UnitedHealth group company is a leading health services and innovation company dedicated to help make the health system work better for everyone. With more than 115,000 people worldwide, Optum combines technology, data and expertise to improve the delivery, ...

Lead Site Reliability Engineer-gcp Ops, Terraform, Python, Etc. @ Optum

Home > Devops