About the job
Exp: 3-6 years; Develop and plan required analytic projects in response to business needs. Partner with Product and Engineering teams to solve problems and identify trends and opportunities. Processing, cleansing, and verifying the integrity of data used for analysis Apply expertise in quantitative analysis, data mining, and the presentation of data to see beyond the numbers and understand how our users interact with both our consumer and business products. Design and evaluate experiments Data mining using state-of-the-art methods Selecting features, building and optimizing classifiers using machine learning techniques Enhancing data collection procedures to include information that is relevant for building analytic systems Extending companys data with third party sources of information when needed Doing ad-hoc analysis and presenting results in a clear manner Building and fine tuning analytical answers Visualise the data with charts and graphs Assist in building and analysing dashboards and reports Contribute to data mining architectures, modelling standards, reporting, and data analysis methodologies Work with application developers to extract data relevant for analysis Propose what to build in the next roadmap Understand ecosystems, user behaviours, and long-term trends Identify new levers to help move key metrics An attitude that ensures safe and secure operations. A security & privacy first approach to dealing with everything. Ability to lead initiatives and people toward common goals. Should possess good analytical and interpersonal communication skills. Able to write and communicate effectively. Motivated to work in start-up environment. Requirements 3 years of overall experience in Data Science Excellent understanding of machine learning techniques and algorithms, such as k-NN, Naive Bayes, SVM, Decision Forests, Neural Networks etc. Experience with common data science toolkits, such as Python, NumPy, Pandas etc. Experience with data visualisation tools, such as D3.js, GGplot, etc. Automating analysis and authoring pipelines via SQL, NoSQL and python based ETL frameworks Proficiency in using query languages such as SQL, Hive, Pig Experience with NoSQL databases, such as MongoDB, Cassandra, HBase Good applied statistics skills, such as distributions, statistical testing, regression, etc. Knowledge of Java and Big Data technologies such as Spark / Storm / Flink will be a plus.
Desired Skills and Experience
hive , algorithms , authoring , data mining , sql , pig , d3.js , security , java , big data , mongodb , hbase , python