ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A review on Joining Approaches in Hadoop Framework and Skewness Associate to it

Journal: International Journal of Engineering and Techniques (Vol.2, No. 6)

Publication Date:

Authors : ;

Page : 166-170

Keywords : Hadoop; Mapreduce; Vertical scaling; Joining approaches; Data skewness.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Today�s era is generally treated as the era of data on each and every field of computing application huge amount of data is generated. The society is gradually more dependent on computers so large amount of data is generated in each and every second which is either in structured format, unstructured format or semi structured format. These huge amount of data are generally treated as big data. To analyze big data is a biggest challenge in current world. Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage and it generally follows horizontal processing. Map Reduce programming is generally run over Hadoop Framework and process the large amount of structured and unstructured data. This Paper describes about different joining strategies used in Map reduce programming to combine the data of two files in Hadoop Framework and also discusses the skewness problem associate to it.

Last modified: 2018-05-18 21:46:49