ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

SURVEY OF SIMILARITY JOIN ALGORITHMS BASED ON MAPREDUCE

Journal: MATTER: International Journal of Science and Technology (Vol.2, No. 1)

Publication Date:

Authors : ; ; ; ;

Page : 214-234

Keywords : Hadoop; MapReduce; Similarity Join;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Similarity Join is a data processing and analysis operation that retrieves all data pairs whose their distance is less than a pre-defined threshold. The similarity join algorithms are used in different real world applications such as finding similarity in documents, images, and strings. In this survey we will explain some of the similarity join algorithms which are based on MapReduce approach. These algorithms are: Set-Similarity Join, SSJ-2R, MRSimJoin, Pair-wise similarity, multi-sig-er method, Trie-join, and PreJoin algorithm. We then make a comparison between these algorithms according to some criteria and discuss the results.

Last modified: 2018-04-26 17:33:44