SPARK BASED DISTRIBUTED FREQUENT ITEMSET MINING TECHNIQUE FOR BIG DATA
Journal: International Journal of Advanced Research in Engineering and Technology (IJARET) (Vol.11, No. 10)Publication Date: 2020-10-31
Authors : Thirumaran. S R. Nagarajan;
Page : 1800-1814
Keywords : Big data; Data mining; Distributed frequent itemset mining; Frequent itemset mining; Parallel FP-growth; Spark.;
Abstract
In the phase of association rule mining, frequent itemset mining is an important move. Conventional methods to mine frequent itemsets in the big data epoch pose major obstacles as there is insufficient processing capacity and memory space. This work presents a novel frequent itemset mining technique that utilizes both matrix-based pruning method and Spark. The presented Spark based distributed (SBD) frequent itemset mining technique can efficiently reduce the number of campaign itemsets and also boost the performance of iterative computing. The experimental analysis is carried out using typical benchmark datasets to compare the presented SBD algorithm with the previous algorithm such as parallel FP-growth and distributed frequent itemset mining. The numerical results obtained from the experimental analysis indicate that SBD algorithm has better scalability and efficiency. Furthermore, a case study is accomplished to verify the efficacy of the presented SBD frequent itemset mining technique.
Other Latest Articles
- Anti-Müllerian Hormone Related to Reproductive and Productive Longevity in Egyptian Buffaloes
- DOCUMENT CLUSTERING USING STATISTICAL INTEGRATED GRAPH BASED CO-WORD INTERPRETATION APPROACH
- INNOVATIONMANAGEMENT USINGTHE RADARTOOL: AN AUDIT RESULTFROM DUBAIELECTRICITYANDWATER AUTHORITY (DEWA)IN U.A.E
- A COMPARATIVE STUDY ON THE PERFORMANCE ANALYSIS OF SOFTWARE DEVELOPMENT COST MODEL WITH EXPONENTIAL FAMILY DISTRIBUTION PROPERTY
- PERFORMANCE ASSESSMENT OF RECYCLED AGGREGATE CONCRETE SHORT COLUMNS SUBJECTED TO ECCENTRIC COMPRESSION LOAD
Last modified: 2021-04-19 22:26:28