An Improved Hierarchical Technique for Document Clustering
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 4)Publication Date: 2015-04-05
Authors : Priti B. Kudal; Manisha Naoghare;
Page : 1983-1986
Keywords : Data Mining; Clustering; Classification; Similarity Measure; Term Frequency;
Abstract
Data mining is the process of non-trivial discovery from implied, previously unknown, and potentially useful information from data in large databases. Hence it is a core element in knowledge discovery, often used synonymously. Clustering, one of technique for data mining used for grouping similar terms together. Earlier statistical analysis used in text mining depends on term frequency. Then, new concept based text mining model was introduced which analyses terms. Clustering of document is useful for the purpose of document organization, summarization, and information retrieval in an efficient way. Initially, clustering is applied for enhancing the information retrieval techniques. Of late, clustering techniques have been applied in the areas which involve browsing the gathered data or in categorizing the outcome provided by the search engines for the reply to the query raised by the users. In this paper, we are providing a comprehensive survey over the document clustering.
Other Latest Articles
- Morphometric Estimation of Cephalic Index in North Indian Population: Craniometrics Study
- State of Charge Estimation for LiFePO4 Cells
- Comparative Analysis of Hybrid Intrusion Detection System and Intrusion Prevention System for MANET
- The Role of House Flies (Musca domestica) as a Vector for Parasitic Pathogens in Al-Diwaniya Province / Iraq
- Design and Analysis of Composite Drive Shaft
Last modified: 2021-06-30 21:44:39