A Comparative Study on Apache Spark and Map Reduce with Performance Analysis Using KNN and Page Rank Algorithm
Journal: International Journal of Trend in Scientific Research and Development (Vol.2, No. 4)Publication Date: 2018-08-01
Authors : Himanshu Suhas Mone Shilpa Deshmukh;
Page : 2391-2396
Keywords : Big data; Hadoop; Map Reduce; Spark; Mahout; MLib; Machine Learning; KNN; Page Rank;
Abstract
With the unremitting advancement of internet, IT and enhancement of technology, tremendous growth of data has been observed. Data is getting generated at very tremendous speed, referred to as Big Data. Big Data has gained more prominence in recent times with continuous explosion of data resulting from various sources. The major focus of this paper is to compare performance between Hadoop and Spark on iterative and machine learning algorithm. Hadoop and Spark both are processing model for analysing big data and their performance varies significantly based on the use case under implementation. In this paper, we compare these two frameworks along with providing the performance analysis using a standard machine learning algorithm for classification (knn) and Page Rank algorithm. Himanshu Suhas Mone | Shilpa Deshmukh"A Comparative Study on Apache Spark and Map Reduce with Performance Analysis Using KNN and Page Rank Algorithm" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-2 | Issue-4 , June 2018, URL: http://www.ijtsrd.com/papers/ijtsrd15629.pdf http://www.ijtsrd.com/computer-science/other/15629/a-comparative-study-on-apache-spark-and-map-reduce-with-performance-analysis-using-knn-and-page-rank-algorithm/himanshu-suhas-mone
Other Latest Articles
Last modified: 2018-08-02 16:11:33