Desired Candidate Profile
Responsibilities:
-Implementation of Big Data infrastructure and technologies and data flow processes including data extraction and ETL processes to populate data lake.
-Develop a cloud-native and hybrid cloud solutions in an enterprise environment.
-Work with data scientist(s) to ensure availability of needed data sets.
-Monitor performance and be accountable to make necessary infrastructure adjustments.
-Work with our security team to establish and enforce data retention and data security policies.
-Work with various stakeholders to establish a big data technology roadmap across the enterprise.
-In collaboration with various stakeholders, gain an understanding of data sources and recommend areas for process optimization.
Must-Have Skills:
-Strong software engineering fundamentals in either Java or C/C++.
-Experience with designing and developping applications across public & private cloud (AWS/Azure/Google Cloud)
-Proficient understanding of distributed computing and the Hadoop technology ecosystem.
-Experienced with data query/ingestion tools such as Hive, Sqoop, Flume, Kafka, Spark, and Storm.
-Expert understanding of architecture principles - SOA, SOAP, REST API, microservice
-Experience with APIs and Container Ecosystems (Kubernetes, Cloud Foundry, Apigee, Mesos, Docker, Swarm, etc.)
-Unit-testing with JUnit, TestNG or Hadoop testing tools.
-Experienced with NoSQL databases such as HBase, MongoDB, Cassandra, PostgreSQL.
-Proven experience in dealing with complex, unclean data with techniques around wrangling/munging.
-Strong communication skills with the ability to facilitate open discussions, highlight issues and develop a broadly acceptable consensus amongst diverse stakeholders.
-Post-secondary degree in Computer Science, Engineering or related discipline.
-Experience with Python (Scikit-Learn, Numpy, Pandas), Tensorflow.
-Visualization experience with Tableau or Qlik.
Certifications:
-Cloud Developer or Architect certification for AWS or Google Cloud or Azure
-Hadoop certification on Cloudera or Hortonworks
Contact Details: