An Improved Clustering Algorithm for Text Mining: Multi-Cluster Spherical K-Means
Journal: The International Arab Journal of Information Technology (Vol.13, No. 1)Publication Date: 2016-01-01
Authors : Volkan Tunali; Turgay Bilgin; Ali Camurcu;
Page : 12-19
Keywords : Data mining; text mining; document clustering; SKM.;
Abstract
Thanks to advances in information and communication technologies, there is a prominent increase in the amount of information produced specifically in the form of text documents. In order to, effectively deal with this “information explosion” problem and utilize the huge amount of text databases, efficient and scalable tools and techniques are indispensable. In this study, text clustering which is one of the most important techniques of text mining that aims at extracting useful information by processing data in textual form is addressed. An improved variant of spherical K-Means (SKM) algorithm named multi-cluster SKM is developed for clustering high dimensional document collections with high performance and efficiency. Experiments were performed on several document data sets and it is shown that the new algorithm provides significant increase in clustering quality without causing considerable difference in CPU time usage when compared to SKM algorithm.
Other Latest Articles
- The Utilization of Education Management Information System (EMIS) to Predict Future Maintenance Plans for Buildings: A Case Study of Karak Governorate Schools
- 5G LTE-A Cognitive Multiclass Scheduling Scheme for Internet of Things
- VParC: A Compression Scheme for Numeric Data in Column-Oriented Databases
- The Impact of Human Resources Information Systems on Human Resources Selection and Recruitment Strategy: An applied study on Arab Potash Company in the Hashemite Kingdom of Jordan
- A Fuzzy Analytic Hierarchy Process for Security Risk Assessment of Web based Hospital Management System
Last modified: 2019-11-13 18:21:59