An Anti-Spam Filter Based on One-Class IB Method in Small Training Sets
Journal: The International Arab Journal of Information Technology (Vol.13, No. 6)Publication Date: 2016-11-01
Authors : Chen Yang; Shaofeng Zhao; Dan Zhang; Junxia Ma;
Page : 677-685
Keywords : IB method; one-class IB; anti-spam filter; Small training sets.;
Abstract
We present an approach to email filtering based on one -class Information Bottleneck (IB) method in small training sets. When themes of emails are changing continually, the available training set which is high -relevant to the current theme will be small. Hence, we further show how to estimate the learning algorithm and how to filter the spam in the small training sets. First, In order to preserve classification accuracy and avoid over -fitting while substantially reducing training set size, we consider the learning framework as the solution of one -class centroid only averaged by highly positive emails, and second, we design a simple binary classification model to filters spam by the comparison of similarity between emails and centroids. Experimental results show that in small training sets our method can significantly improve classification accuracy co mpared
with the currently popular methods, such as: Naive Bayes, AdaBoost and SVM
Other Latest Articles
- Metacognitive Awareness Assessment and Introductory Computer Programming Course Achievement at University
- Metacognitive Awareness Assessment and Introductory Computer Programming Course Achievement at University
- Metacognitive Awareness Assessment and Introductory Computer Programming Course Achievement at University
- Multiple-View Face Hallucination by a Novel Regression Analysis in Tensor Space
- The Refinement Check of Added Dynamic Diagrams Based on -Calculus
Last modified: 2019-11-14 16:46:56