Location: Pune , Bangalore Must have Overall 5 to 9 yrs of IT experience Must have 5 to 7 years of hands-on Strong programming experience in Python/PySpark development on EMR (data curation) Pyspark scripting - extracting data from source system (AWS S3, RDS, AWS Redshift) Hands on experience in handling different types of files using python e.g. CSV, Parquet. JSON Must have experience with NOSQL and SQL (redshift preferred) queries. Should be good in SQL Analytical functions, experience participating in key business, architectural and technical decisions Must have expertise in extracting data from different source systems like flat files, API sources, Big data appliances, RDBMS, etc., using python.