All About You
As a Lead Software Engineer, you will be responsible for designing, implementing, and maintaining Messaging systems, Stateful Event Processing systems and to contribute to observability, autonomics and scalability.
As a Lead Observability Engineer, you will be responsible for designing, implementing, and maintaining our observability platform. you'll work closely with cross-functional teams to ensure our systems are transparent, measurable, and reliable. By leveraging your expertise in observability tools and techniques, you will help us gain deep insights into our applications, infrastructure, and user experiences. Responsibilities:
Design and develop robust observability solutions to monitor, analyze, and troubleshoot highly distributed systems, especially message streaming systems, stateful even processing systems
Familiar with OTEL standards and tools.
Previous experience working with application teams to implement self-healing ie alerting that triggers automated remediation.
Implement and configure monitoring, logging, tracing, and alerting systems to ensure comprehensive coverage of our infrastructure and applications.
Collaborate with software engineers to instrument code for telemetry data collection and analysis.
Optimize observability tooling and processes to improve system reliability, performance, and scalability.
Create dashboards, reports, and visualizations to provide actionable insights into system health and performance.
Investigate and resolve incidents by analyzing telemetry data and identifying root causes.
Stay current with industry trends and best practices in observability and recommend improvements to our observability strategy and infrastructure. Qualifications:
Bachelors degree in computer science, Engineering, or a related field (or equivalent experience).
5-10 years experience as an Observability Engineer or a similar role in a production environment.
Deep understanding of observability principles, methodologies, and tools such as Prometheus, Grafana, Jaeger, ELK stack, etc
Proficiency in programming/scripting languages like Java, Python, Go, or similar for automation and tooling development.
Strong practical knowledge on messaging systems like Kafka, Flink, MQ
Strong knowledge of cloud computing platforms (AWS preferred) and container orchestration systems (eg, Kubernetes).
Excellent problem-solving skills and the ability to troubleshoot complex issues in distributed systems.
Strong communication skills and the ability to collaborate effectively with cross-functional teams. As a Lead Software Engineer, you will be responsible for designing, implementing, and maintaining Messaging systems, Stateful Event Processing systems and to contribute to observability, autonomics and scalability.

Keyskills: Computer science Cloud computing Automation Payment systems Information security Data collection splunk Distribution system application lifecycle management Python
Who is Mastercard?Mastercard is a global technology company in the payments industry. Our mission is to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart, and accessible. Using secure data and networks,...