Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Kafka Site Reliability Engineer @ NetApp

Home > Devops

 Kafka Site Reliability Engineer

Job Description

Job Summary

Our TechOps Engineers are the frontline team keeping our large fleet of cloud-hosted Apache Kafka, Cassandra, OpenSearch, Cadence, Valkey, Clickhouse and PostgreSQL clusters up and running. Every day you will diagnose and solve challenging and interesting technical problems providing a service that is relied on by some of the leading global names in tech to deliver for millions of end users.  Our service is relied on by some of the leading global names in Banking and Financial Services, Telecom, IoT and Tech companies that interact with millions of end users. 

This role is for Tech-ops engineer, primarily focusing on Apache Kafka Opensource technology - that includes operating, maintaining, upgrading and continuously improving the Managed Service for Kafka (across AWS, Azure and GCP) to deliver a great customer experience. This role includes participating in a rotating Level-2 roster. 

Job Requirements
  • Good Cloud operational knowledge (AWS or Azure or GCP) 
  • Preferably have past IT Customer service/support experience. 
  • Strong knowledge and experience with Linux and be comfortable working from the command line (essential)  
  • Good fundamental Computer science / software engineering skills and knowledge, particularly Operating System internals, memory management, and networking. 
  • Investigating/researching Kafka issues by reviewing the Apache Kafka codebase or Kafka Jira project would be a plus.   
  • Programming skills in Python or Java, and source code control using Git would be a plus.    
  • Be a proactive, reliable, and supportive member of the Technical Operations team for Kafka, and participate in a rotating L2 shift roster.
  • Provide expert operational support to our nodes running in the cloud (AWS, Azure, and GCP), using technologies such as Linux (Debian), Docker, and languages including Java, Python and bash. 
  • Liaise with our customers engineers in resolving interesting issues related to Apache Kafka usage and other supported technologies. 
  • Undertake complex cluster operations such as migrations, upgrades, and maintenance on our fleet. 
  • Develop and continually improve our suite of internal automation tools, applications, and processes.
Education
  • A minimum of 5 years of experience is mandatory, while 5 to 8 years of experience is highly desirable.
  • A Bachelor of Science Degree in Electrical Engineering or Computer Science, a Master's Degree, a PhD, or equivalent experience is required..

Job Classification

Industry: Software Product
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: NetApp
Location(s): Bengaluru

+ View Contactajax loader


Keyskills:   customer service project source site reliability engineering docker cloud java git apache automation tools automation postgresql computer science gcp linux debian software engineering programming jira python issue microsoft azure control cassandra kafka investigation bash aws

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

DevOps Engineer - L4

  • Wipro
  • 5 - 8 years
  • Hyderabad
  • 8 hours ago
₹ Not Disclosed

DevOps Engineer - L3

  • Wipro
  • 3 - 5 years
  • Bengaluru
  • 10 hours ago
₹ Not Disclosed

DevOps Engineer - L4

  • Wipro
  • 5 - 8 years
  • Bengaluru
  • 18 hours ago
₹ Not Disclosed

DevOps Engineer - L4

  • Wipro
  • 5 - 8 years
  • Pune
  • 23 hours ago
₹ Not Disclosed

NetApp

NetApp