A Parallel AprioriAll-Based Sequential Pattern Mining Algorithm on MapReduce
Proceeding: The International Conference on E-Technologies and Business on the Web (EBW)Publication Date: 2013-05-07
Authors : Pianwittayasakun Patcharee; Zhu Hongming; Yang Xiaowen; Zhou You;
Page : 68-73
Keywords : Sequential pattern mining; Concept lattice; Distributed systems; Hadoop;
Abstract
AprioriAll is a well-known algorithm for sequential pattern mining, an important and very useful for identifying patterns that can be used for predicting behaviors and future trends to answer business questions. However, AprioriAll algorithm has some limitations concerning sequential pattern mining of huge datasets where the technique suffers in performance and scalability. There are some attempts to handle this problem in distributed way, And we as well in this paper proposes a parallel sequential pattern mining algorithm called PAA (Parallel AprioriAll). PAA is based on a AprioriAll algorithm with modifications to make use of a Concept Lattice model in order to run it in a distributed fashion using Hadoop and MapReduce. We have developed a prototype implementing the PAA algorithm and made a comparative study with stand-alone AprioriAll algorithm. The empirical comparison results show that PAA algorithm is correct, produce the same result and outperform the AprioriAll algorithm.
Other Latest Articles
- A Web-Based Evaluation Framework for Supporting Novice and Expert Evaluators of Adaptive E-Learning Systems
- Research and Implementation of Secure Authentication in E-Commerce System Based on PKI and USB Key
- Integrated Architecture for Web Application Development Based on Spring Framework and Activiti Engine
- A Study on Internet Financial Reporting of Private University in Taiwan and USA: A Perspective from Financial Transparency
- The Effectiveness of Technology-Mediation on Learning Second Foreign Language - A Case Study of Vocational College Students
Last modified: 2013-08-30 22:36:47