Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Lead Analyst - Lead Bigdata Developer - Python, Pyspark & Sql @ CGI

Home > Software Development

 Lead Analyst - Lead Bigdata Developer - Python, Pyspark & Sql

Job Description

Job Summary: CGI is looking for a Lead Big Data Developer who will design, build, and optimize large-scale data pipelines and processing systems. The ideal candidate is a hands-on technologist with deep expertise in Python, PySpark, and SQL, capable of leading data initiatives, mentoring team members, and ensuring delivery of high-performance data solutions across the organization.


Your future duties and responsibilities:
  • Lead the design and development of scalable, efficient, and reliable data pipelines using PySpark, Python, and SQL
  • Collaborate with data architects, analysts, and business stakeholders to understand data requirements and translate them into technical solutions
  • Optimize data workflows for performance, scalability, and cost efficiency in big data environments (e.g, Databricks, EMR, GCP DataProc, or similar)
  • Implement data ingestion, transformation, and aggregation processes from multiple structured and unstructured sources
  • Ensure data quality, integrity, and consistency through validation, testing, and monitoring frameworks
  • Work with cloud-based data platforms (AWS, Azure, or GCP) and leverage tools like S3, Delta Lake, or Snowflake
  • Design and enforce best practices for coding, version control, and CI/CD within the data engineering team
  • Provide technical leadership and mentorship to junior and mid-level developers
  • Collaborate with DevOps and DataOps teams for deployment and operationalization of data solutions
  • Stay updated with the latest technologies and trends in the big data ecosystem
Required qualifications to be successful in this role:
  • Required Skills & Experience:- 8+ years of experience in data engineering or big data development, with at least 3+ years in a lead or senior role
  • Strong proficiency in Python for data processing, scripting, and automation
  • Advanced hands-on experience with PySpark (RDD, DataFrame, and Spark SQL APIs)
  • Deep expertise in SQL (query optimization, analytical functions, performance tuning)
  • Strong understanding of distributed data processing and data lake architectures
  • Experience working with Hadoop ecosystem (Hive, HDFS, Spark, Kafka, etc)
  • Hands-on experience with cloud platforms (AWS, Azure, or GCP) and data orchestration tools (Airflow, ADF, etc)
  • Solid understanding of data modeling, ETL design, and performance optimization
  • Experience with version control (Git) and CI/CD pipelines for data projects
  • Excellent communication and leadership skills, with the ability to guide cross-functional teams
  • Preferred Qualifications:- Experience with Delta Lake / Apache Iceberg / Hudi
  • Knowledge of containerization and orchestration (Docker, Kubernetes)
  • Exposure to machine learning pipelines or data science integration
  • Certification in AWS Big Data / GCP Data Engineer / Azure Data Engineer is a plus
  • Education:- Bachelors or masters degree in computer science, Information Technology, or a related field.
Skills:
  • English
  • Python
  • SQL
  • Analytical Thinking.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Platform Engineer
Employement Type: Full time

Contact Details:

Company: CGI
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   Big data hive pyspark sql docker git data science spark gcp devops hadoop etl azure s3 snowflake python data engineer airflow databricks machine learning data quality query optimization kafka aws Kubernetes

 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Hiring For AXIOM developer resources in Mumbai

  • Clover Infotech
  • 4 - 7 years
  • Mumbai
  • 14 hours ago
₹ 5-15 Lacs P.A.

Freelance - Partime - Python Developer

  • TJL Dynamics
  • 2 - 6 years
  • Chennai
  • 15 hours ago
₹ 96,000-1.2 Lacs P.A.

Senior .Net Developer

  • Hyperworks Imaging
  • 10 - 12 years
  • Bengaluru
  • 15 hours ago
₹ 27.5-30 Lacs P.A.

Oracle Brm Developer

  • Teliolabs
  • 3 - 7 years
  • India
  • 15 hours ago
₹ Not Disclosed

CGI

Mphasis applies next-generation technology to help enterprises transform businesses globally. Customer centricity is foundational to Mphasis and is reflected in the Mphasis Front2Back™ Transformation approach. Front2Back™ uses the exponential power of cloud and cognitive to provide hyper-persona...