Detailed JD :
- Very strong in Python and Unix / Linux Shell Scripting
- Should have worked on Cloud (Preferably AWS, EC2, S3, CLI Components)
- Knowledge in acquiring files from web portals/ FTP/ S2 (Web-scraping exposure is plus)
- Complex file handling: Complex string editing, XML/ JSON parsing (Linux/ Python)
- Extract, load, carry out other DB activities (using Linux/ Python)
- Hands on experience in AWS CLI for S3, SQS (Python Boto series)
- Experiece in handling performance issues with very large files (>15 GB) using multi-threading, multiprocessing concepts of scripting (forking concepts and other ways to run taskes in parallel in Linux, Python multi-threading and multiprocessing etc)
- Should be able to develop highly modularized code to ensure max code re-usability (Linux functions, Python - Functions, Classes and Objects OOPs)