ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Grouping and Categorization of Documents in Relativity Measure?

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.2, No. 3)

Publication Date:

Authors : ;

Page : 89-94

Keywords : Document Clustering; Correlation Latent Semantic Indexing; Dimensionality Reduction; Correlation Measure;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

This paper presents a spectral clustering method called correlation through preserving indexing (CPI), which is to perform in the correlation similarity measure space. The documents are considered into a low dimensional semantic space, the correlations between the documents in the local patches are maximized and correlations between the documents outside these patches are minimized. The intrinsic structure of the document space is included in the similarities between the documents. Correlation is the similarity measure for finding the intrinsic structure of the document space than Euclidean distance. Simultaneously, the proposed CPI methods can effectively finding the intrinsic structures included in high-dimensional document space. The effectiveness of the new method is implemented by extensive experiments conducted on various data sets and by comparison with existing document clustering methods.

Last modified: 2013-04-20 19:55:56