Job Description
Meet the Team
We are the Webex Persistence Team within the Webex Service Engineering SRE organization.
Our team designs, builds, and operates highly scalable persistence services powering Webex Teams, Webex Meetings, and the broader Webex Suite. We work across Webex Data Centers and AWS Cloud environments, ensuring high availability, reliability, and performance at a global scale.
As part of this team, you will work on mission-critical infrastructure and platform systems, including data, streaming, and persistence layers, with a strong focus on automation, scalability, and cloud-native engineering.
Your Impact
As a Senior Cassandra / DevOps Engineer, you will play a critical role in designing, operating, and optimizing large-scale Cassandra supporting Webex services.
You will help build resilient, high-performance systems and drive engineering excellence in database and infrastructure operations.
- Design, implement, and manage large-scale Apache Cassandra clusters in production
- Optimize performance, troubleshoot issues, and ensure high availability of distributed systems
- Architect and deploy Cassandra solutions on AWS (EC2, EKS, managed services) and OCP environments.
- Perform capacity planning, cluster sizing, and performance tuning
- Define data models including partitioning strategies, replication, and consistency levels
- Collaborate with engineering teams to build high-throughput, low-latency data systems
- Drive migration strategies for Cassandra to new versions.
- Establish backup, recovery, and disaster recovery proces ses
- Monitor system health and reliability using tools like Prometheus and Grafana
- Contribute to infrastructure automation using Terraform , Automation by Ansible, and Python.
- 8+ years of hands-on experience in Apache Cassandra design, administration, and cluster management in production environments
- Strong expertise in Cassandra architecture including data modeling, partitioning, replication strategies, consistency levels, and repair mechanisms
- Proven experience in performance tuning, troubleshooting, and designing highly available distributed systems
- Experience deploying and operating Cassandra on cloud platforms such as AWS (EC2, EKS, VPC, S3, IAM, CloudWatch) , OCP
- Proficiency in Cassandra Query Language (CQL) and programming in Java or Python, along with experience using Cassandra drivers
- Experience with infrastructure automation and DevOps practices, including CI/CD tools (Jenkins, GitHub Actions, GitLab CI) and Infrastructure as Code tools (Terraform, Ansible), Python
- Experience working with relational and search databases such as PostgreSQL, OpenSearch, or Elasticsearch
- Hands-on experience with containerization and orchestration technologies such as Docker and Kubernetes
- Exposure to modern data platforms, including data pipelines, streaming systems, or AI/ML and GenAI workloads
Job Classification
Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time
Contact Details:
Company: Cisco
Location(s): Bengaluru
Keyskills:
Performance tuning
Automation
Data modeling
Postgresql
Disaster recovery
Apache
cisco
Distribution system
Python
Capacity planning