Lead I - Data Engineering @ UST

Home > Software Development

Lead I - Data Engineering

UST
5 - 7 years
Bengaluru
4 months ago
Email to a friend
Report this job

Job Description

Role Proficiency:

This role requires proficiency in data pipeline development including coding and testing data pipelines for ingesting wrangling transforming and joining data from various sources. Must be skilled in ETL tools such as Informatica Glue Databricks and DataProc with coding expertise in Python PySpark and SQL. Works independently and has a deep understanding of data warehousing solutions including Snowflake BigQuery Lakehouse and Delta Lake. Capable of calculating costs and understanding performance issues related to data solutions.

Outcomes:

Act creatively to develop pipelines and applications by selecting appropriate technical options optimizing application development maintenance and performance using design patterns and reusing proven solutions.rnInterpret requirements to create optimal architecture and design developing solutions in accordance with specifications.
Document and communicate milestones/stages for end-to-end delivery.
Code adhering to best coding standards debug and test solutions to deliver best-in-class quality.
Perform performance tuning of code and align it with the appropriate infrastructure to optimize efficiency.
Validate results with user representatives integrating the overall solution seamlessly.
Develop and manage data storage solutions including relational databases NoSQL databases and data lakes.
Stay updated on the latest trends and best practices in data engineering cloud technologies and big data tools.
Influence and improve customer satisfaction through effective data solutions.

Measures of Outcomes:

Adherence to engineering processes and standards
Adherence to schedule / timelines
Adhere to SLAs where applicable
# of defects post delivery
# of non-compliance issues
Reduction of reoccurrence of known defects
Quickly turnaround production bugs
Completion of applicable technical/domain certifications
Completion of all mandatory training requirements
Efficiency improvements in data pipelines (e.g. reduced resource consumption faster run times).
Average time to detect respond to and resolve pipeline failures or data issues.
Number of data security incidents or compliance breaches.

Outputs Expected:

Code Development:

Develop data processing code independently
ensuring it meets performance and scalability requirements.
Define coding standards
templates
and checklists.
Review code for team members and peers.

Documentation:

Create and review templates
checklists
guidelines
and standards for design
processes
and development.
Create and review deliverable documents
including design documents
architecture documents
infrastructure costing
business requirements
source-target mappings
test cases
and results.

Configuration:

Define and govern the configuration management plan.
Ensure compliance within the team.

Testing:

Review and create unit test cases
scenarios
and execution plans.
Review the test plan and test strategy developed by the testing team.
Provide clarifications and support to the testing team as needed.

Domain Relevance:

Advise data engineers on the design and development of features and components
demonstrating a deeper understanding of business needs.
Learn about customer domains to identify opportunities for value addition.
Complete relevant domain certifications to enhance expertise.

Project Management:

Manage the delivery of modules effectively.

Defect Management:

Perform root cause analysis (RCA) and mitigation of defects.
Identify defect trends and take proactive measures to improve quality.

Estimation:

Create and provide input for effort and size estimation for projects.

Knowledge Management:

Consume and contribute to project-related documents
SharePoint
libraries
and client universities.
Review reusable documents created by the team.

Release Management:

Execute and monitor the release process to ensure smooth transitions.

Design Contribution:

Contribute to the creation of high-level design (HLD)
low-level design (LLD)
and system architecture for applications
business components
and data models.

Customer Interface:

Clarify requirements and provide guidance to the development team.
Present design options to customers and conduct product demonstrations.

Team Management:

Set FAST goals and provide constructive feedback.
Understand team members' aspirations and provide guidance and opportunities for growth.
Ensure team engagement in projects and initiatives.

Certifications:

Obtain relevant domain and technology certifications to stay competitive and informed.

Skill Examples:

Proficiency in SQL Python or other programming languages used for data manipulation.
Experience with ETL tools such as Apache Airflow Talend Informatica AWS Glue Dataproc and Azure ADF.
Hands-on experience with cloud platforms like AWS Azure or Google Cloud particularly with data-related services (e.g. AWS Glue BigQuery).
Conduct tests on data pipelines and evaluate results against data quality and performance specifications.
Experience in performance tuning of data processes.
Expertise in designing and optimizing data warehouses for cost efficiency.
Ability to apply and optimize data models for efficient storage retrieval and processing of large datasets.
Capacity to clearly explain and communicate design and development aspects to customers.
Ability to estimate time and resource requirements for developing and debugging features or components.

Knowledge Examples:

Knowledge Examples

Knowledge of various ETL services offered by cloud providers including Apache PySpark AWS Glue GCP DataProc/DataFlow Azure ADF and ADLF.
Proficiency in SQL for analytics including windowing functions.
Understanding of data schemas and models relevant to various business contexts.
Familiarity with domain-related data and its implications.
Expertise in data warehousing optimization techniques.
Knowledge of data security concepts and best practices.
Familiarity with design patterns and frameworks in data engineering.

Additional Comments:

Role(s): Data Engineer Role Location(s): India Planned Start Date: 11/3/2025 Role Scope / Deliverables: Create new data pipelines in DataBricks. Support existing data pipelines in DataBricks. Create DAG setup in in airflow. Resolve day to day job failures/performance issues. Key Skills: Minimum 4+ years of total experience. Proficient in Databricks, Pyspark/Python (minimum 3+ year of experience) Proficient in SQL (minimum 3+ year of experience) Hands on experience in could AWS/Azure (preferably AWS) Good to have analytical experience Excellent communication. Should be able to work independently.

Required Skills

Databricks,Pyspark,SQL,Python

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: UST
Location(s): Bengaluru

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: glue pyspark data warehousing data pipeline sql analytics apache etl tool gcp awsazure design bigquery etl programming communication skills snowflake python development airflow talend microsoft azure dataproc nosql data bricks aws glue optimization techniques aws informatica

Job seems aged, it may have been expired!
Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

ML Engineer

Capgemini

6 - 11 years

Pune

2 days ago

₹ Not Disclosed

MLOPS Engineer

Cognizant

6 - 10 years

Chennai

3 days ago

₹ Not Disclosed

Oracle Ebs Finance Consultant

Tech Mahindra

8 - 12 years

Hyderabad

3 days ago

₹ Not Disclosed

Java Full Stack Developer - Angular | 4+ Years

Capgemini

4 - 9 years

Pune

3 days ago

₹ Not Disclosed

UST

INDUSTRIAL ROBOTICS INSTITUTE

Lead I - Data Engineering @ UST

Home > Software Development