This course aims to provide an overview of machine learning, data mining and statistical techniques that arise in data analytic applications. The course will cover the MapReduce programming framework; H2O Framework, Mahout, Apache Storm Framework, Spark framework. In this course, the student will learn: parallel algorithms for big data processing, massive Data Analytics, topic modeling, Time series analysis; Spatial time series analysis; Graph mining and Graph modeling