Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Azure Data Engineer @ Infogain

Home > Software Development

 Azure Data Engineer

Job Description

  • Lead design and execution of Dataproc Databricks PySpark migration roadmap.
  • Define modernization strategy , including data ingestion, transformation, orchestration, and governance.
  • Architect scalable Delta Lake and Unity Catalog -based solutions.
  • Manage and guide teams on code conversion, dependency mapping, and data validation.
  • Collaborate with platform, infra, and DevOps teams to optimize compute costs and performance.
  • Own the automation & GenAI acceleration layer , integrating code parsers, lineage tools, and validation utilities.
  • Conduct performance benchmarking, cost optimization, and platform tuning (Photon, Auto-scaling, Delta Caching).
  • Mentor senior and mid-level developers, ensuring quality standards, documentation, and delivery timelines.
Technical Skills
  • Languages: Python, PySpark, SQL
  • Platforms: Databricks (Jobs, Workflows, Delta Live Tables, Unity Catalog), GCP Dataproc
  • Data Tools: Hadoop, Hive, Pig, Spark (RDD & DataFrame APIs), Delta Lake
  • Cloud & Integration: GCS, BigQuery, Pub/Sub, Cloud Composer, Airflow
  • Automation: GenAI-powered migration tools, custom Python utilities for code conversion
  • Version Control & DevOps: Git, Terraform, Jenkins, CI/CD pipelines
  • Other: Performance tuning, cost optimization, and lineage tracking with Unity Catalog
Preferred Experience
  • 10-14 years of data engineering experience with at least 3 years leading Databricks or Spark modernization programs.
  • Proven success in migration or replatforming projects from Hadoop or Dataproc to Databricks.
  • Exposure to AI/GenAI in code transformation or data engineering automation .
  • Strong stakeholder management and technical leadership skills.
EXPERIENCE
  • 11-12 Years

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Infogain
Location(s): Noida, Gurugram

+ View Contactajax loader


Keyskills:   hive python technical leadership data validation performance tuning airflow pyspark apache pig data engineering artificial intelligence sql dataproc data bricks automation apache git stakeholder management spark gcp data ingestion hadoop bigquery

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Lead Engineer Software

  • Empower
  • 3 - 8 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

Cloud Automation Engineer (AI Enablement)

  • Teamlease Digital
  • 6 - 10 years
  • Hyderabad
  • 2 days ago
₹ 1.25-2.5 Lacs P.A.

Cloud FinOps Engineer (AI-Enablement)

  • Teamlease Digital
  • 6 - 10 years
  • Hyderabad
  • 2 days ago
₹ 1-2 Lacs P.A.

Data Engineer

  • ConverseHR
  • 2 - 5 years
  • Hyderabad
  • 2 days ago
₹ 15-25 Lacs P.A.

Infogain

Infogain is a Silicon Valley headquartered company with software platform engineering and deep domain expertise in the travel, retail, insurance and high technology industries. We accelerate the delivery of digital customer engagement systems using digital technologies such as cloud, mic...