ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

SURVEY ON LOAD BALANCING AND DATA SKEW MITIGATION IN MAPREDUCE APPLICATIONS

Journal: International Journal of Computer Engineering and Technology (IJCET) (Vol.6, No. 1)

Publication Date:

Authors : ; ;

Page : 32-41

Keywords : Iaeme Publication; IAEME; Technology; Engineering; IJCET; Data Skew; Hadoop; Map Reduce; Partition Skew;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Since few years Map Reduce programming model have shown great success in processing huge amount of data. Map Reduce is a framework for data-intensive distributed computing of batch jobs. This data-intensive processing creates skew i n Map Reduce framework and degrades performance by great value. This leads to greatly varying execution time for the Map Reduce jobs. Due to this varying execution times results into lo w resource utilization and high overall execution time since new Map Reduce cycle can start only after all reducers are completed. During this paper, we are going to study various methodologies and techniques used to mitigate data skew and partition skew, and describe various advantages and limitations of these approaches.

Last modified: 2016-05-27 20:21:10