ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Presenting a Hybrid Feature Selection Method Using Chi2 and DMNB Wrapper for E-Mail Spam Filtering

Journal: International Journal of Computer Science and Network Solutions (IJCSNS) (Vol.1, No. 2)

Publication Date:

Authors : ;

Page : 16-28

Keywords : Feature Selection; Classification; Spam Filtering; Machine Learning.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

The growing volume of spam emails has resulted in the necessity for more accurate and efficient email classification system. The purpose of this research is presenting an machine learning approach for enhancing the accuracy of automatic spam detection and filtering and separating them from legitimate messages. In this regard, for reducing the error rate and increasing the efficiency, the hybrid architecture on feature selection has been used. Features used in these systems, are the body of text messages. Proposed system of this research has used the combination of two filtering models, Filter and Wrapper, with Chi Squared (Chi2) filter and Discriminative Multinomial Naïve Bayes (DMNB) wrapper as feature selectors. In addition, MNB classifier, DMNB classifier, SVM classifier and Random Forest classifier are used for classification. Finally, the output results of this classifiers and feature selection methods are examined and the best design is selected and it is compared with another similar works by considering different parameters. The optimal accuracy of the proposed system is evaluated equal to 99%.

Last modified: 2013-10-16 04:47:26