Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Azure Data Engineer (Senior) @ Infogain

Home > Software Development

 Azure Data Engineer (Senior)

Job Description

Key Responsibilities

  • Analyze existing Hadoop, Pig, and Spark scripts from Dataproc and refactor them into Databricks-native PySpark.

  • Implement data ingestion and transformation pipelines using Delta Lake best practices.

  • Apply conversion rules and templates for automated code migration and testing.

  • Conduct data validation between legacy and migrated environments (schema, count, and data-level checks).

  • Collaborate on developing AI-driven tools for code conversion, dependency extraction, and error remediation.

  • Ensure best practices for code versioning, error handling, and performance optimization.

  • Participate in UAT, troubleshooting, and post-migration validation activities.

Technical Skills

  • Core: Python, PySpark, SQL

  • Databricks: Delta Lake, Unity Catalog, Databricks Workflows, MLflow (basic understanding)

  • GCP: Dataproc, BigQuery, GCS, Composer/Airflow, Cloud Functions

  • Data Engineering: Hadoop, Hive, Pig, Spark SQL

  • Automation: Experience with migration utilities or AI-assisted code transformation tools

  • CI/CD: Git, Jenkins, Terraform (preferred)

  • Validation: Data comparison utilities (Delta-to-Delta, DataFrame diffing, schema validation)

Preferred Experience

  • 5-8 years in data engineering or big data application development.

  • Hands-on experience migrating Spark or Hadoop workloads to Databricks.

  • Familiarity with Delta architecture, data quality frameworks, and GCP cloud integration.

  • Exposure to GenAI-based tools for automation or code refactoring is a plus.

EXPERIENCE

  • 6-8 Years


SKILLS

  • Primary Skill: Data Engineering
  • Sub Skill(s): Data Engineering


  • Additional Skill(s): Python, Apache Hadoop, Apache Hive, Apache Airflow, synapse, databricks, SQL, Apache Spark, Azure Data Factory, Pyspark, GenAI Fundamentals, Cloud Pub/Sub, BigQuery



ABOUT THE COMPANY

Infogain is a human-centered digital platform and software engineering company based out of Silicon Valley. We engineer business outcomes for Fortune 500 companies and digital natives in the technology, healthcare, insurance, travel, telecom, and retail & CPG industries using technologies such as cloud, microservices, automation, IoT, and artificial intelligence. We accelerate experience-led transformation in the delivery of digital platforms. Infogain is also a Microsoft (NASDAQ: MSFT) Gold Partner and Azure Expert Managed Services Provider (MSP).


Infogain, an Apax Funds portfolio company, has offices in California, Washington, Texas, the UK, the UAE, and Singapore, with delivery centers in Seattle, Houston, Austin, Krak w, Noida, Gurgaon, Mumbai, Pune, and Bengaluru.








Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Infogain
Location(s): Noida, Gurugram

+ View Contactajax loader


Keyskills:   Telecom Automation Healthcare Application development Data quality MSP microsoft Troubleshooting SQL Python

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Custom Software Engineer

  • Accenture HR Aditi
  • 3 - 8 years
  • Noida, Gurugram
  • 11 hours ago
₹ Not Disclosed

Custom Software Engineer

  • Accenture HR Aditi
  • 3 - 8 years
  • Noida, Gurugram
  • 15 hours ago
₹ Not Disclosed

Application Architect-Azure Cloud Migration

  • IBM
  • 3 - 8 years
  • Pune
  • 15 hours ago
₹ Not Disclosed

Custom Software Engineer

  • Accenture HR Aditi
  • 3 - 8 years
  • Noida, Gurugram
  • 18 hours ago
₹ Not Disclosed

Infogain

A global digital engineering company delivering technology solutions that accelerate business outcomes. It specializes in cloud, data, AI, and experience-led transformation for enterprises across industries.