Review on Text Clustering Based on Frequent Itemset?

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 4)

Publication Date: 2014-04-30

Authors : Prajakta Jaswante P.R. Deshmukh;

Page : 91-96

Keywords : Text mining; Text clustering; Text documents; Frequent itemsets; Apriori; Reuter-21578;

Source : Download Find it from : Google Scholar

Abstract

Recently the vast amount of textual information available in electronic form is growing at staggering rate. This increasing number of textual data has led to the task of mining useful or interesting frequent itemsets (words/terms) from very large text databases and still it seems to be quite challenging. The use of such frequent itemsets for text clustering has received a great deal of attention in research community since the mined frequent itemsets reduce the dimensionality of the documents drastically. In the proposed research, we have considered an efficient approach for text clustering based on the frequent itemsets. A renowned method, called Apriori algorithm is used for mining the frequent itemsets. The mined frequent itemsets are then used for obtaining the partition, where the documents are initially clustered without overlapping. Furthermore, the resultant clusters are effectively obtained by grouping the documents within the partition by means of derived keywords. Finally, for experimentation, any of the dataset can be used and thus the obtained outputs can ensure that the performance of the proposed approach has been improved effectively.

Main Menu

Searching By

PARTNERS

Review on Text Clustering Based on Frequent Itemset?

Abstract

Advertisement