A review on Joining Approaches in Hadoop Framework and Skewness Associate to it
Journal: International Journal of Engineering and Techniques (Vol.2, No. 6)Publication Date: 2016-11-01
Authors : Bibhudutta Jena;
Page : 166-170
Keywords : Hadoop; Mapreduce; Vertical scaling; Joining approaches; Data skewness.;
Abstract
Today�s era is generally treated as the era of data on each and every field of computing application huge amount of data is generated. The society is gradually more dependent on computers so large amount of data is generated in each and every second which is either in structured format, unstructured format or semi structured format. These huge amount of data are generally treated as big data. To analyze big data is a biggest challenge in current world. Hadoop is an open-source framework that allows to store and process big data in a distributed environment across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage and it generally follows horizontal processing. Map Reduce programming is generally run over Hadoop Framework and process the large amount of structured and unstructured data. This Paper describes about different joining strategies used in Map reduce programming to combine the data of two files in Hadoop Framework and also discusses the skewness problem associate to it.
Other Latest Articles
Last modified: 2018-05-18 21:46:49