ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A text similarity measure for document classification

Journal: IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (Vol.12, No. 1)

Publication Date:

Authors : ; ;

Page : 14-25

Keywords : Feature Selection; Feature Reduction; Clustering; Classification; Dimensionality;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Dimensionality reduction is very challenging and important in text mining. We need to know which features be retained what to be and It helps in reducing the processing overhead when performing text classification and text clustering. Another concern in text clustering and text classification is the similarity measure which we choose to find the similarity degree between any two text documents. In this paper, we work towards text clustering and text classification by addressing the use of the proposed similarity measure which is an improved version of our previous measures. This proposed measure is used for supervised and un-supervised learning. The proposed measure overcomes the disadvantages of the existing measures.

Last modified: 2019-12-13 20:45:43