An Investigation on Topic Maps Based Document Classification with Unbalance Classes
Journal: Journal of Independent Studies and Research - Computing (Vol.13, No. 1)Publication Date: 2015-06-01
Authors : Maher Baloch Muhammad Rafi;
Page : 50-56
Keywords : ;
Abstract
Classification of imbalanced data has become a widespread problem due to the fact that the most real world datasets are imbalanced. In a classification task, one of the challenges is to learn the feature-space of classification under class-imbalance setting. The majority classes generally have good representation of features in the learned classification function and the minority classes lack this representation; subsequently, the classification for these classes failed more often. In this paper, authors investigate the task of document classification with topic map based representation of documents under class imbalance setting. In order to measure of topic-map based representation for classification under imbalance data, authors compare three representations: Bag-ofWords, Phrases and Topic terms for three approaches (i) under-sampling, (ii) cost-adjusting, and (iii) cluster based sampling. A series of experiments are carried out and results are reported.
Other Latest Articles
- Performance Analysis of Table Driven and Event Driven Protocols for Voice and Video Services in MANET
- Analysis of SSD Utilization by Graph Processing Systems
- Graph Visualization Tools: A Comparative Analysis
- Probabilistic Vs. Soft Computing for Classifying Credit Card Transactions. A Case Study of Pakistani's Credit Card Data
- Extracting patterns from Global Terrorist Dataset (GTD) Using Co-Clustering approach
Last modified: 2018-07-17 01:06:26