Desired Candidate Profile
Collection, processing and storing of huge amount of online retail data into the data lake
Designing, implementing and administering complex big data processing solutions
Data ingestion and processing using MapR DB under MapR Converged Data Platform and Spark/Scala
Extraction of various formats of data: relational & structured to semi structured, unstructured formats
Designing and implementing big data solutions for batch and near real time streaming of data using streaming technologies like Kafka/MapR Streams, Streamsets In MapR Converged Data Platform
Creating knowledge repository for the implementations and solutions designed to handle large volume of structured and unstructured data
Code migration and deployment across multiple environments using source code and configuration management tools
Skills needed:
Strong knowledge and hands on experience on MapR converged data platform, certification on this is an added advantage
Strong exposure and work experience in ingesting huge volume of data into data lake, preferably hive and MapRDB is a must have
Good exposure and hands on MapR DB
Work experience on Spark/Scala along with streaming exposure
Good hands on streaming technologies like Map R Streams, kafka
A very good understanding and working knowledge of RDBMS. Oracle-EBS is a value add
Strong work experience in Apache Sqoop in ingesting relational data into data lake
Good exposure and understanding of source code management/migration tools like GitHub, bit bucket, etc.
Strong experience in moving/migrating code bases across multiple environments
Preferred certified Map R .
Contact Details: