ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

IMPROVING PERFORMANCE OF DATA IN HADOOP CLUSTERS USING DYNAMIC DATA REPLICA PLACEMENT: A SURVEY

Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.7, No. 2)

Publication Date:

Authors : ;

Page : 153-156

Keywords : Apache Hadoop; HDFS; MapReduce; Data Replication Placement; MapReduce applications;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Big data refers to various forms of large information sets that require special computational platforms in order to be analyzed. Research on big data emerged in the 1970s but has seen an explosion of publications since 2008. The Apache Hadoop software library based framework gives permissions to distribute huge amount of datasets processing across clusters of computers using easy programmer models. In this paper, we discuss the architecture of Hadoop, survey paper of various data replication placement strategies and propose an approach for the improvement of data replica placement and suggest an implementation of proposed algorithm with various MapReduce applications for improving performance of data in Hadoop clusters with respect to execution time and number of nodes in Hadoop platform

Last modified: 2018-02-08 22:46:23