ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Enhancing the Hadoop Performance through Data Placement in Heterogeneous Hadoop Cluster

Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 4)

Publication Date:

Authors : ; ;

Page : 3150-3153

Keywords : Big Data; Hadoop; Heterogeneous Cluster; Data Placement;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In the present world large volumes of data are getting generated and these records and data details have to be maintained for future purpose. Keeping these large bulks of data and using them becomes difficult. To overcome this and make it easy to store, use and work with it a tool called Hadoop is used. Hadoop uses the concept of a cluster that is many small nodes together form a cluster. Nodes with varying configurations (like varying RAM sizes, processors) form a heterogeneous cluster. Data placement technique in heterogeneous cluster is complicated. The data placement technique in heterogeneous cluster helps in the efficient use of resources and when combined with the MapReduce programming model increases the performance. Data placement can be done by forming racks. In this work we enhance the performance of Hadoop in heterogeneous cluster by first creating racks for data placement and then modifying certain parameters of Hadoop tool. The techniques are implemented and evaluated in Hadoop 1.0.3.

Last modified: 2021-06-30 21:44:39