Job Description
Paytm Risk team solves some of the most complex DevOps issues across the globe. Some of our DNA fingerprints are:
1) Cloud-First Paytm is a cloud-first company where we work on some of the largest cloud (AWS) workloads.
2) Scalability We tame scale. Our definition of scale is -- PBs of data, millions of requests per minute, few thousand microservices hosted on a few hundred thousand cores.
3) Innovation is our blood. Automation is our key Mantra. We work smartly -- We have designed one of the world's largest CI platforms, our own AWS cost management platform, EBS autoscale (Yes we downscale EBS without even an ms of outage), and so on.
4) Observability One of the best in-house designed observability systems processing a few 10s of millions of events per second and many more.
About the Role:
As a Lead DevOps Engineer, you will be responsible for driving continuous integration and deployment (CI/CD), managing cloud-based infrastructure, and ensuring high availability, scalability, and security of our applications. You will work with automation tools, observability solutions, and AI-driven enhancements to optimize DevOps workflows and improve operational efficiency. This role requires deep expertise in cloud cost optimization, infrastructure as code (IaC), monitoring, and zero-downtime deployments.
Key responsibilities:
1) Work with our engineering team to design and implement a highly available, scalable, cost-effective and secure infrastructure.
2) Develop a deep understanding of our complex architecture, automate infrastructure and deployment using code and ensure performance, reliability and uptime of every component of the system.
3) Improve observability of the system, troubleshoot production incidents, identify root causes and implement corrective and preventive measures.
4) Work with our Information security team to implement security fixes and make systems secure and compliant as per the guidelines.
5) Document and implement best practices and strategies around running Low-latency high-throughput applications in the Cloud.
6) Manage and improve our NoSQL/big-data infrastructure (Cassandra, EMR etc)
7) Participate in weekly On call rotation for the production systems.
Expectations/Requirements:
1) 3+ years of overall experience in DevOps.
2) Cloud expertise: AWS (Preferred) / Azure / Google Cloud.
3) Infrastructure as Code (IaC): Hands-on experience with Terraform for cloud resource provisioning.
CI/CD & Deployment Pipelines:
1) Experience with Jenkins, Bamboo, GitLab CI/CD.
2) Expertise in Zero Downtime Deployment strategies.
3) Strong knowledge of GitOps practices using ArgoCD.
Containerization & Orchestration:
1) Experience with Docker & Kubernetes (EKS/GKE/AKS).
Monitoring & Observability:
1) Strong experience with ELK Stack (Elasticsearch, Logstash, Kibana) for logging and observability.
2) Experience with Grafana & Prometheus for system monitoring.
Database Systems:
1) Experience in setting up and administering NoSQL databases including Cassandra and MongoDB.x
Messaging & Caching Systems:
1) Experience managing Kafka (production-level clusters).
2) Hands-on expertise in Redis (production-level clusters).
Scripting & Automation:
1) Python scripting for automation (preferred).
AI & Automation Tools:
1) Experience leveraging AI tools (ChatGPT, GitHub Copilot, etc.) to automate routine DevOps tasks and improve productivity.
2) Ability to integrate AI-driven solutions into DevOps workflows.
Cloud Cost Optimization:
1) Strong understanding of cost-saving best practices for cloud infrastructure.
2) Ability to mentor junior engineers and collaborate with cross-functional teams.
3) Superpowers/Skills That Will Help You Succeed in This Role:
4) High level of drive, initiative, and self-motivation.
5) Strong problem-solving skills with a growth mindset.
6) Excellent communication and stakeholder management.
7) Passion for automation and AI-driven efficiencies.
8) Willingness to experiment, innovate, and continuously improve.
Job Classification
Industry: Banking
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: DevOps Engineer
Employement Type: Full time
Contact Details:
Company: Paytm
Location(s): Bengaluru
Keyskills:
devops engineer
gke
kubernetes
python
github
ai
aks
iac
redis
nosql
microservices
docker
cassandra
grafana
devops
kafka
jenkins
gitlab
terraform
prometheus
big data
aws
mongodb
azure