ISSUES ON TRADITIONAL AND MODERN TEXTUAL DOCUMENT CLUSTERING ALGORITHMSJournal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.5, No. 11)
Publication Date: 2016-11-30
Authors : Wael Yafooz;
Page : 392-395
Keywords : textual document clustering; frequent - term; partitional clustering; hierarchical clustering .;
The amount of digital data utilized in daily life has increased owing to the high dependence on such data. Most data can be stored in textual documents. With the rapid increase in the number of textual documents, users face problems in obtaining useful information. Thus, a method by which to manage data is required to give users an idea about content. In addition, t echniques to increase the ratio of precision in information retrieval results are also needed. Therefore, the textual document clustering area is developed to represent the data in meaningful clusters. The two main factors encountered in the process of tex tual document clustering are efficiency and goodness or quality of data clusters. Efforts have been exerted to deal with these factors. These attempts can be categorized into either traditional or modern approaches. However, these attempts also face numero us issues. In this paper, we present the previous and current issues faced by textual document clustering algorithms to help text domain researchers understand these issues. This study provides researchers and students an overview about textual document cl ustering algorithms. Furthermore, this study can encourage researchers to find solutions to these issues.
Other Latest Articles
Last modified: 2016-11-18 19:38:36