ResearchBib Share Your Research, Maximize Your Social Impacts
注册免费获得最新研究资源 注册 >> 登录

A comparison of most recent MapReduce joins algorithms

期刊名字: Multi-Knowledge Electronic Comprehensive Journal For Education And Science Publications (MECSJ) (Vol.2017, No. 2)

Publication Date:

论文作者 : ;

起始页码 : 87-121

关键字 : MapReduce; Hadoop; join types; multi-way join; theta-join; KNN join; top-k join; graph similarity join; semi join; filter join; bloom join; intersection join;

论文网址 : Download 您也可以查找论文通过 : Google Scholarexternal

论文摘要

In this interesting line of research, an attempt has been to overview different parallel processing platforms that implement MapReduce jobs. This survey provides a wide-ranging analysis of work and publications related to MapReduce framework to data, and it also can be used as a basis for further research and examination. The scope of this survey is focused on pre-processing, pre-filtering, partitioning, replication, load balancing, performance, memory space, communication cost, and query processing and optimization aspects in the light of big data analysis in MapReduce. Moreover, a set of efficient optimized and improved approaches in the context of analytical query processing and optimizing using MapReduce. It provides an added value to current research published yearly by introducing a comprehensive classification of recently presented papers in the era of join types using MapReduce. From data-centric perspective, the main topic of this approach is intended to highlight the importance of traditional problems of data management and analysis in the regard of efficient big data processing and analysis approaches.

更新日期: 2018-06-04 20:57:45