Desired Candidate Profile
Description
This position will be responsible for becoming an expert on a number of different data sets. On a day to day basis this will require doing research and analysis on the data to find ways to improve quality and deliver new features, summarizing this analysis, working with Software Engineering for implementation in the product, and working with the broader QA team so we have ongoing reported metrics to track.
Responsibilities/Key Tasks
Job Functions:
Build a holistic understanding of our data assets, its customer, the BI data infrastructure & environment, and business goals
Analyze large data sets to gain an understanding of the data, discover data anomalies, and look for ways to leverage data in support of organizational goals and initiatives.
Use big data technologies to build predictive systems on big data platforms (Hadoop Ecosystem)
Analyze and validate data using statistical tools to answer product and business questions
Use data to create models that depict trends in diverse data sets
Perform data manipulations such as data imports, wrangling, exports and updates
Assist with data mining, API development and connectivity with 3rd party data vendors
Ability to explain statistical approaches/models used, rationale/merits of different approaches, results and interpretation to team members
Experience building data pipelines
Experience working with large data sets 10TB++
Qualifications/Educational Requirements
3+ year experience with Big Data, Hadoop, NoSQL databases especially SQL, Hive, Spark
Strong data interpretation, visualization and presentation skills
Self-motivated, agile, excellent collaborative skills and ability to influence diverse audiences
Coding ability in a language of your choice (some wed be excited about: Python, R, Java, Scala, etc.) and a desire to learn more!
Must have desire to work in a team environment yet be self-directed, proactive, and action-oriented.
Basic knowledge of DNS, Internet protocols, VPN, web servers experience preferred.
Comfort with statistical and machine learning techniques
5-7+ years relevant experience; engineering/math College degree required, advanced degree desired
Education:
UG: Any Graduate - Any Specialization
PG: Any Postgraduate - Any Specialization
Doctorate: Any Doctorate - Any Specialization
Contact Details:
Keyskills:
Java
Machine Learning
R
Python
SQL
Data Mining
Spark
Hadoop
Research Analysis
SCALA