Lead end-to-end data and ETL testing efforts for large-scale data platforms and pipelines
Develop and maintain automated test frameworks and test scripts using Python for data validation and quality assurance
Perform data integrity, reconciliation, and regression testing across multiple systems and environments
Validate ETL workflows, mappings, and transformations in tools such as Informatica, Talend, DataStage, or PySpark-based pipelines
Work closely with data engineering teams to understand business logic, data flows, and transformation rules
Design and implement data quality metrics, monitoring dashboards, and issue tracking processes
Conduct root cause analysis for data anomalies and collaborate with development teams for resolution
Drive continuous improvement in testing strategy, automation coverage, and process efficiency
Mentor and guide junior QA engineers in testing methodologies, scripting, and automation frameworks
Required qualifications to be successful in this role:
Bachelors/masters degree in computer science, Information Technology, or related field
8+ years of experience in data/ETL testing, with at least 2+ years in a lead role
Strong expertise in Python scripting for test automation and data validation
Hands-on experience with ETL tools such as Informatica, Talend, DataStage, or custom Spark/SQL-based ETL
Strong knowledge of SQL for complex queries, joins, aggregations, and validation
Experience testing data warehouses, data lakes, and big data ecosystems (Cloudera, AWS, Azure, GCP)
Familiarity with PySpark, Airflow, or other data orchestration tools is a plus
Exposure to CI/CD pipelines (Jenkins, Git, Docker) for automated test execution
Good understanding of data governance, data lineage, and data quality frameworks
Excellent analytical, debugging, and communication skills
Preferred Skills: Experience with API testing and data exchange formats (JSON, XML)
Familiarity with cloud-based data services (AWS Glue, Redshift, BigQuery, Snowflake)
Knowledge of Agile/Scrum methodologies
Experience with pytest, unittest, or other Python-based test frameworks
Skills:
Informatica
Oracle
Shell Script
Job Classification
Industry: IT Services & ConsultingFunctional Area / Department: Engineering - Software & QARole Category: DBA / Data warehousingRole: Data warehouse Architect / ConsultantEmployement Type: Full time