A text similarity measure for document classification
Journal: IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (Vol.12, No. 1)Publication Date: 2017-07-01
Authors : Gali Suresh Reddy; T. V. Rajinikanth;
Page : 14-25
Keywords : Feature Selection; Feature Reduction; Clustering; Classification; Dimensionality;
Abstract
Dimensionality reduction is very challenging and important in text mining. We need to know which features be retained what to be and It helps in reducing the processing overhead when performing text classification and text clustering. Another concern in text clustering and text classification is the
similarity measure which we choose to find the similarity degree between any two text documents. In this paper, we work towards text clustering and text classification by addressing the use of the proposed similarity measure which is an improved version of our previous measures. This proposed measure is used for supervised and un-supervised learning. The proposed measure overcomes the disadvantages of the existing measures.
Other Latest Articles
- A new paradigm for information systems projects management based on a knowledge management approach
- EDUCATIONAL MARKETING STRATEGIES ON THE MARKET OF HIGHER EDUCATION SERVICES
- REGIONAL EDUCATIONAL EQUITY: A SURVEY ON THE ABILITY TO DESIGN SCIENTIFIC EXPERIMENTS OF SIXTH-GRADE STUDENTS
- DEVELOPMENT OF ESTONIAN UPPER SECONDARY SCHOOL STUDENTS’ BIOLOGICAL CONCEPTUAL UNDERSTANDING AND COMPETENCES
Last modified: 2019-12-13 20:45:43