A text similarity measure for document classification

Journal: IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (Vol.12, No. 1)

Publication Date: 2017-07-01

Authors : Gali Suresh Reddy; T. V. Rajinikanth;

Page : 14-25

Keywords : Feature Selection; Feature Reduction; Clustering; Classification; Dimensionality;

Source : Download Find it from : Google Scholar

Abstract

Dimensionality reduction is very challenging and important in text mining. We need to know which features be retained what to be and It helps in reducing the processing overhead when performing text classification and text clustering. Another concern in text clustering and text classification is the similarity measure which we choose to find the similarity degree between any two text documents. In this paper, we work towards text clustering and text classification by addressing the use of the proposed similarity measure which is an improved version of our previous measures. This proposed measure is used for supervised and un-supervised learning. The proposed measure overcomes the disadvantages of the existing measures.

Main Menu

Searching By

PARTNERS

A text similarity measure for document classification

Abstract

Advertisement