ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

An Efficient Algorithm for Extracting Infrequent Itemsets from Weblog

Journal: The International Arab Journal of Information Technology (Vol.16, No. 2)

Publication Date:

Authors : ;

Page : 275-280

Keywords : Infrequent itemsets; lattice; frequent itemsets; weblog; supportthreshold;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Weblog data contains unstructured information. Due to this, extracting frequent pattern fromweblog databases is a very challenging task. A power set lattice strategy is adopted for handling that kind of problem. In this lattice, the top label contains full set and at the bottom label contains empty set. Most number of algorithms follows bottom-up strategy, i.e. combining smaller to larger sets. Efficient lattice traversal techniques are presented which quickly identify all the long frequent itemsets and their subsets if required. This strategy is suitable for discovering frequent itemsets but it might not be worth being used for infrequent itemsets. In this paper, we propose Infrequent Itemset Mining for Weblog (IIMW) algorithm; it is a top-down breadth-first level-wise algorithm for discovering infrequent itemsets. We have compared our algorithm IIMW to Apriori-Rare, Apriori-Inverse and generated result in with different parameters such as candidate itemset, frequent itemset, time, transaction database and support threshold.

Last modified: 2019-04-28 19:19:41