Learning spam features using restricted boltzmann machines
Journal: IADIS INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS (Vol.11, No. 1)Publication Date: 2016-03-15
Authors : Luis Alexandre da Silva; Kelton Augusto Pontara da Costa; Patricia Bellin Ribeiro; Gustavo Henrique de Rosa; João Paulo Papa;
Page : 99-114
Keywords : Spam Detection; Machine Learning; Restricted Boltzmann Machines; Optimum-Path Forest.;
Abstract
Nowadays, spam detection has been one of the foremost machine learning-oriented applications in the context of security in computer networks. In this work, we propose to learn intrinsic properties of e-mail messages by means of Restricted Boltzmann Machines (RBMs) in order to identity whether such messages contain relevant (ham) or non-relevant (spam) content. The main idea contribution of this work is to employ Harmony Search-based optimization techniques to fine-tune RBM parameters, as well as to evaluate their robustness in the context spam detection. The unsupervised learned features are then used to feed the Optimum-Path Forest classifier, being the original features extracted from e-mail content and compared against the new ones. The results have shown RBMs are suitable to learn features from e-mail data, since they obtained favorable results in the datasets considered in this work.
Other Latest Articles
- User feedback session with clicked and unclicked documents for related search recommendation
- Usage analysis in sap erp-systems for measurement of business maturity ? exemplified in purchasing
- Between immersion and emersion: orientating digital games towards virtual and physical spaces
- Secure Health Statistical Analysis Methods
- C-Warehousing: A HL7 CDA-based approach for the secondary use of clinical data
Last modified: 2016-12-21 21:49:09