Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Senior Engineer - Grafana + Python Specialist @ Xebia It Architects

Home > Devops

 Senior Engineer - Grafana + Python Specialist

Job Description

Position: Senior Engineer Observability & Automation (Grafana + Python Specialist)

Experience: 6+ years


Role Overview
We are seeking a highly skilled Senior Engineer with deep expertise in Grafana and Python to lead our observability, automation, and monitoring initiatives for cloud-native environments. This role involves end-to-end ownership of telemetry pipelines, real-time dashboards, alerting systems, and automated workflows, ensuring optimal system reliability and performance.


Key Responsibilities

  • Design, implement, and maintain advanced Grafana dashboards for infrastructure, application, and business metrics.
  • Build and optimize Python-based automation tools for metrics collection, log processing, health checks, and anomaly detection.
  • Integrate observability solutions with Azure Monitor, Log Analytics, Prometheus, and other telemetry backends.
  • Define and monitor SLIs/SLOs, implementing real-time alerting for proactive incident management.
  • Collaborate with Development and DevOps teams to instrument applications for full-stack visibility.
  • Optimize monitoring pipelines for performance, cost, and reliability.
  • Contribute to infrastructure automation and CI/CD processes using Python and Azure DevOps tools.
  • Lead best practices, tool selection, and training initiatives for observability standards across teams.

Required Skills

  • Grafana Expertise: Dashboard templating, multi-source integrations (e.g., Jira, SonarQube, Octopus), advanced alerting, and notifications (Email, Teams).
  • Python Proficiency: Automation scripting, API integrations, telemetry pipelines, CLI tool development.
  • Experience with containerized environments (Docker, Kubernetes, AKS).
  • Strong knowledge of distributed systems, tracing, and root cause analysis workflows.

Preferred Skills

  • Infrastructure-as-Code (Terraform, Helm, Bicep) for observability stack setup.
  • Exposure to time-series databases (InfluxDB, TimescaleDB).

Soft Skills

  • Strong problem-solving and analytical mindset.
  • Excellent communication and stakeholder engagement skills.
  • Passion for automation, system reliability, and continuous improvement.
  • Self-driven with a high sense of ownership.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: DevOps
Role: Site Reliability Engineer
Employement Type: Full time

Contact Details:

Company: Xebia It Architects
Location(s): Delhi, NCR

+ View Contactajax loader


Keyskills:   Prometheus Azure Monitoring Grafana Log Analytics Python Docker Terraform Aks Incident Management Helm Kubernetes

 Fraud Alert to job seekers!

₹ 20-30 Lacs P.A

Similar positions

AWS DevOps Engineer

  • Cognizant
  • 12 - 16 years
  • Bengaluru
  • 18 hours ago
₹ Not Disclosed

DevOps Engineer L4

  • Wipro HR Soniya
  • 5 - 8 years
  • Pune
  • 3 days ago
₹ Not Disclosed

Engineer For Sovereign Cloud Delivery (btp / Sac)

  • SAP Servers Tech
  • 4 - 9 years
  • Bengaluru
  • 3 days ago
₹ Not Disclosed

Azure Engineer

  • Cognizant
  • 5 - 8 years
  • Hyderabad
  • 9 days ago
₹ Not Disclosed

Xebia It Architects

Xebia IT Architects India Pvt Ltd Xebia is a Dutch headquartered IT company which specializes in Agile Coaching, Consulting & Transformation, Continuous Delivery & DevOps, Full Stack Agile Development, Big Data/Data Science, Mobile, Cloudification and Data Centre Automation. With offi...