ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Learning spam features using restricted boltzmann machines

Journal: IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (Vol.11, No. 1)

Publication Date:

Authors : ; ; ; ; ;

Page : 99-114

Keywords : Spam Detection; Machine Learning; Restricted Boltzmann Machines; Optimum-Path Forest.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Nowadays, spam detection has been one of the foremost machine learning-oriented applications in the context of security in computer networks. In this work, we propose to learn intrinsic properties of e-mail messages by means of Restricted Boltzmann Machines (RBMs) in order to identity whether such messages contain relevant (ham) or non-relevant (spam) content. The main idea contribution of this work is to employ Harmony Search-based optimization techniques to fine-tune RBM parameters, as well as to evaluate their robustness in the context spam detection. The unsupervised learned features are then used to feed the Optimum-Path Forest classifier, being the original features extracted from e-mail content and compared against the new ones. The results have shown RBMs are suitable to learn features from e-mail data, since they obtained favorable results in the datasets considered in this work.

Last modified: 2016-12-21 21:49:09