
Keyskills: scala java spark hadoop python big data administration hive cloudera pyspark data warehousing apache pig business intelligence sql apache flume mysql big data etl hbase oozie impala data engineering nosql mapreduce kafka sqoop aws yarn unix