Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Azure Data Pipeline Developer - Databricks, Data Factory, Pyspark, Ml @ Infogain

Home > Software Development

 Azure Data Pipeline Developer - Databricks, Data Factory, Pyspark, Ml

Job Description


Work Location: Bengaluru, Pune, Mumbai, Noida, Gurugram.

Mode: Hybrid

Notice Period: 0 to 15 Days


Key Responsibilities:

Develop scalable data pipelines using Azure Data Factory (ADF), Databricks, PySpark, and Delta Lake to support ML and AI workloads.

Optimize and transform large datasets for feature engineering, model training, and real-time AI inference.

Build and maintain lakehouse architecture using Azure Data Lake Storage (ADLS) & Delta Lake.

Work closely with ML engineers & Data Scientists to deliver high-quality, structured data for training Generative AI models.

Implement MLOps best practices for continuous data processing, versioning, and model retraining workflows.

Monitor & improve data quality using Azure Data Quality Services.

Ensure cost-efficient data processing in Databricks using Photon, Delta Caching, and Auto-Scaling Clusters.

Secure data pipelines by implementing RBAC, encryption, and governance.


Required Skills & Experience:

6+ years of experience in Data Engineering with Azure & Databricks.

Proficiency in PySpark, SQL, and Delta Lake for large-scale data transformations.

Strong experience with Azure Data Factory (ADF), Azure Synapse, and Event Hubs.

Hands-on experience in building feature stores for ML models.

Experience with ML model deployment and MLOps pipelines (MLflow, Kubernetes, or Azure ML) is a plus.

Good understanding of Generative AI concepts and handling unstructured data.

Strong problem-solving, debugging, and performance optimization skills.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA,
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Infogain
Location(s): Noida, Gurugram

+ View Contactajax loader


Keyskills:   ML model deployment Data Engineering Auto Scaling Machine Learning Operations Azure Machine Learning Azure Data Quality Services Photon Azure Data Lake Storage ML Ops pipelines Delta Caching Data Pipeline Delta Lake Azure Data Lake Data Lake

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Application Developer

  • Accenture
  • 7 - 12 years
  • Bengaluru
  • 3 hours ago
₹ Not Disclosed

Senior Software Engineer_Embedded C Developer

  • Capgemini
  • 4 - 7 years
  • Chennai
  • 4 hours ago
₹ Not Disclosed

Software Engineer Iii - React / Ui

  • JPMorgan Chase Bank
  • 0 - 6 years
  • Bengaluru
  • 7 hours ago
₹ Not Disclosed

Senior Software Enginsenior Software Engineer-mbsdeer

  • Capgemini
  • 3 - 6 years
  • Pune
  • 8 hours ago
₹ Not Disclosed

Infogain

Infogain is a Silicon Valley headquartered company with software platform engineering and deep domain expertise in the travel, retail, insurance and high technology industries. We accelerate the delivery of digital customer engagement systems using digital technologies such as cloud, mic...