IMPROVING PERFORMANCE OF DATA IN HADOOP CLUSTERS USING DYNAMIC DATA REPLICA PLACEMENT: A SURVEY
Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.7, No. 2)Publication Date: 2018-02-28
Authors : S. Annapoorani B. Srinivasan;
Page : 153-156
Keywords : Apache Hadoop; HDFS; MapReduce; Data Replication Placement; MapReduce applications;
Abstract
Big data refers to various forms of large information sets that require special computational platforms in order to be analyzed. Research on big data emerged in the 1970s but has seen an explosion of publications since 2008. The Apache Hadoop software library based framework gives permissions to distribute huge amount of datasets processing across clusters of computers using easy programmer models. In this paper, we discuss the architecture of Hadoop, survey paper of various data replication placement strategies and propose an approach for the improvement of data replica placement and suggest an implementation of proposed algorithm with various MapReduce applications for improving performance of data in Hadoop clusters with respect to execution time and number of nodes in Hadoop platform
Other Latest Articles
- PROJECT RISK ANALYSIS FOR INFRASTRUCTURE PROJECT USING SIMULATION TECHNIQUE
- WIRELESS NOTICE BOARD BASED ON ARDUINO AND GSM TECHNOLOGY
- A STUDY ON USE OF LOGISTICS MANAGEMENT BY COURIER SERVICES IN INDIA
- AN OVERVIEW OF CONVENTIONAL FLEXO PLATE MAKING & DIGITAL PLATE MAKING PROCESS
- AN OVERVIEW OF SHEET-FED OFFSET PRESSES FOR OPTIMUM CONSUMPTION OF PRINTING SUBSTRATE
Last modified: 2018-02-08 22:46:23