ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

An Investigation on Topic Maps Based Document Classification with Unbalance Classes

Journal: Journal of Independent Studies and Research - Computing (Vol.13, No. 1)

Publication Date:

Authors : ;

Page : 50-56

Keywords : ;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Classification of imbalanced data has become a widespread problem due to the fact that the most real world datasets are imbalanced. In a classification task, one of the challenges is to learn the feature-space of classification under class-imbalance setting. The majority classes generally have good representation of features in the learned classification function and the minority classes lack this representation; subsequently, the classification for these classes failed more often. In this paper, authors investigate the task of document classification with topic map based representation of documents under class imbalance setting. In order to measure of topic-map based representation for classification under imbalance data, authors compare three representations: Bag-ofWords, Phrases and Topic terms for three approaches (i) under-sampling, (ii) cost-adjusting, and (iii) cluster based sampling. A series of experiments are carried out and results are reported.

Last modified: 2018-07-17 01:06:26