Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Bigdata(Pyspark) Engineer @ Impetus Technologies

Home > Software Development

 Bigdata(Pyspark) Engineer

Job Description

Skills: Bigdata,Pyspark,Python ,Hadoop / HDFS; Spark;
Good to have : GCP
Roles/Responsibilities:
Develops and maintains scalable data pipelines to support continuing increases in data volume and complexity.
Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
Writes unit/integration tests, contributes to engineering wiki, and documents work.
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Works closely with a team of frontend and backend engineers, product managers, and analysts.
Defines company data assets (data models), spark, sparkSQL, and hiveSQL jobs to populate data models.
Designs data integrations and data quality framework.
Basic Qualifications:
BS or MS degree in Computer Science or a related technical field
4+ years of SQL experience (No-SQL experience is a plus)
4+ years of experience with schema design and dimensional data modelling
4+ years of experience with Big Data Technologies like Spark, Hive
2+ years of experience on data engineering on Google Cloud platform services like big query.
Skills: Bigdata,Pyspark,Python ,Hadoop / HDFS; Spark;
Good to have : GCP
Roles/Responsibilities:
Develops and maintains scalable data pipelines to support continuing increases in data volume and complexity.
Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
Writes unit/integration tests, contributes to engineering wiki, and documents work.
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Works closely with a team of frontend and backend engineers, product managers, and analysts.
Defines company data assets (data models), spark, sparkSQL, and hiveSQL jobs to populate data models.
Designs data integrations and data quality framework.
Basic Qualifications:
BS or MS degree in Computer Science or a related technical field
4+ years of SQL experience (No-SQL experience is a plus)
4+ years of experience with schema design and dimensional data modelling
4+ years of experience with Big Data Technologies like Spark, Hive
2+ years of experience on data engineering on Google Cloud platform services like big query.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Engineering - Software & QA
Role Category: Software Development
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Impetus Technologies
Location(s): Chennai

+ View Contactajax loader


Keyskills:   Data analysis Backend GCP Schema Data quality Business intelligence Analytics SQL Python

 Job seems aged, it may have been expired!
 Fraud Alert to job seekers!

₹ Not Disclosed

Similar positions

Software Engineer

  • Orange Business
  • 1 - 5 years
  • Noida, Gurugram
  • 2 days ago
₹ Not Disclosed

Software Engineer

  • Orange Business
  • 1 - 5 years
  • Noida, Gurugram
  • 2 days ago
₹ Not Disclosed

Software Engineer

  • Orange Business
  • 1 - 5 years
  • Noida, Gurugram
  • 2 days ago
₹ Not Disclosed

Software Engineer

  • Orange Business
  • 1 - 5 years
  • Noida, Gurugram
  • 2 days ago
₹ Not Disclosed

Impetus Technologies

Impetus Technologies Impetus Technologies is a software products and services company focused on creating powerful and intelligent enterprises through deep data awareness, data integration and advanced data analytics. Our products and services are designed to empower the real-time data driven en...