Lead Data Engineer - Python/Spark @ Fission Labs

Home > Software Development

Lead Data Engineer - Python/Spark

Fission Labs
5 - 8 years
Hyderabad
11 months ago
Email to a friend
Report this job

Job Description

Role Overview :

A Senior/Lead Data Engineer is responsible for overseeing the design, development, and management of data infrastructure and pipelines within an organisation. This role involves a mix of technical leadership, project management, and collaboration with other teams to ensure the efficient collection, storage, processing, and analysis of large datasets. The Lead Data Engineer typically manages a team of data engineers, architects, and analysts, ensuring that data workflows are scalable, reliable, and meet the business's requirements.Responsibilities :

- Lead the design, development, and maintenance of data pipelines and ETL processes architect and implement scalable data solutions using Databricks and AWS.

- Optimize data storage and retrieval systems using Rockset, Clickhouse, and CrateDB.- Develop and maintain data APIs using FastAPI.- Orchestrate and automate data workflows using Airflow.- Collaborate with data scientists and analysts to support their data needs.- Ensure data quality, security, and compliance across all data systems.- Mentor junior data engineers and promote best practices in data engineering.- Evaluate and implement new data technologies to improve the data infrastructure.- Participate in cross-functional projects and provide technical leadership.- Manage and optimize data storage solutions using AWS S3, implementing best practices for data lakes and data warehouses.- Implement and manage Databricks Unity Catalog for centralized data governance and access control across the organization.Qualifications :

- Bachelor's or Master's degree in Computer Science, Engineering, or related field- 5+ years of experience in data engineering, with at least 3 years in a lead role- Strong proficiency in Python, PySpark, and SQL- Extensive experience with Databricks and AWS cloud services- Hands-on experience with Airflow for workflow orchestration- Familiarity with FastAPI for building high-performance APIs- Experience with columnar databases like Rockset, Clickhouse, and CrateDB- Solid understanding of data modeling, data warehousing, and ETL processes- Experience with version control systems (e.g., Git) and CI/CD pipelines- Excellent problem-solving skills and ability to work in a fast-paced environment- Strong communication skills and ability to work effectively in cross-functional teams- Knowledge of data governance, security, and compliance best practices- Proficiency in designing and implementing data lake architectures using AWS S3- Experience with Databricks Unity Catalog or similar data governance and metadata management tools

Preferred Qualifications :

- Experience with real-time data processing and streaming technologies- Familiarity with machine learning workflows and MLOps- Certifications in Databricks, AWS- Experience implementing data mesh or data fabric architectures- Knowledge of data lineage and metadata management best practicesTech Stack :

- Databricks, Python, PySpark, SQL, Airflow, FastAPI, AWS (S3, IAM, ECR, Lambda), Rockset, Clickhouse, CrateDB

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Fission Labs
Location(s): Hyderabad

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: Data Engineering DataLake Data Pipeline PySpark Azure Databricks Data Warehousing Data Modeling ETL Data Governance Python SQL

Job seems aged, it may have been expired!
Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Similar positions

Site Reliability Engineer

Capgemini

4 - 8 years

Mumbai

1 day ago

₹ Not Disclosed

.Net Core - Engineering Manager / Technical Manager - Hyderabad

Wipro

10 - 20 years

Hyderabad

2 days ago

₹ Not Disclosed

Application Lead

Accenture

5 - 10 years

Kolkata

2 days ago

₹ Not Disclosed

Sr. Software Engineer B2

Cognizant

6 - 10 years

Bengaluru

2 days ago

₹ Not Disclosed

Fission Labs

Fission Labs is a Software Product Development & Services company delivering high-end solutions primarily in the areas of highly scalable cloud applications and analytics for large sets of data. Few of the challenges we are working on are scaling an application to support 150 million users, real-tim...

Lead Data Engineer - Python/Spark @ Fission Labs

Home > Software Development