With a startup spirit and 90,000+ curious and courageous minds, we have the expertise to go deep with the world s biggest brands and we have fun doing it. Now, we re calling all you rule-breakers and risk-takers who see the world differently, and are bold enough to reinvent it. Come, transform with us.
Are you the one we are looking for
We are inviting applications for the role of AM, Data Engineer
In this role, you are expected to develop large scale data structures and pipelines to organize, collect and standardize data that helps generate insights and addresses reporting needs. Writes ETL (Extract / Transform / Load) processes, designs database systems and develops tools for real-time and offline analytic processing, by integrating data from a variety of sources, assuring that they adhere to data quality and accessibility standards. Provide methods to optimize the existing infrastructure. Connect with business to understand and document the requirement. Work on code reviews and deployments of code into production along with implementation of CI/CD pipelines.
Responsibilities
Design and development of large scale data structures and pipelines to organize, collect and standardize data that helps generate insights and addresses reporting needs.
Writes ETL (Extract / Transform / Load) processes, designs database systems and develops tools for real-time and offline analytic processing, by integrating data from a variety of sources, assuring that they adhere to data quality and accessibility standards.
Uses programming in Python to build robust data pipelines and dynamic systems.
Collaborates with different client teams, to develop and maintain long-term relationships with key stakeholders.
Deep engagement and consultation with all business teams to understand current and future needs.
Works with onshore team in establishing design patterns and development standards. Conducts code reviews and oversees unit testing.
Collaborates between on-shore and offshore teams for project plan, code reviews, QA, and deployments.
Brainstorm with development team on optimizing existing data-flow, quality and performance tuning, and building proof-of-concepts.
Lead a team of data engineers.
Qualifications we seek in you
Minimum qualifications
Professional graduate/post-graduate degree in Computer Science discipline
Rich work experience
High degree of flexibility, versatility, and self-motivation are key
Willingness and desire to learn and grow
Preferred skills:
Subject Matter Expert in Big Data and the Hadoop ecosystem.
Experience in designing and developing ETL processes.
Expertise in Hadoop, Hive, MapReduce, Sqoop, Oozie, Hue, HCatalog
Production experience on Cloudera Hadoop Distribution.
Experience in AWS cloud platform.
Hands-On in DevOps tools Github, Maven, Jenkins, Docker and Implementation of CI/CD pipelines
Hands on experience on Snowflake Data Warehouse
Expert level programming experience, ideally in Python, Pyspark or Shell and a willingness to learn new programming languages
Experience processing large amounts of structured and unstructured data, including integrating data from multiple sources.
Handling entire software lifecycle: requirement gathering, project planning and status reporting to various stakeholders
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. For more information, visit www.genpact.com. Follow us on Twitter, Facebook, LinkedIn, and YouTube.
,
Genpact (NYSE: G) is a global professional services firm focused on delivering digital transformation for our clients, putting digital and data to work to create competitive advantage. We do this by integrating lean principles, design thinking, analytics and digital technologies with our domain and ...