A Bi-Level Text Classification Approach for SMS Spam Filtering and Identifying Priority Messages
Journal: The International Arab Journal of Information Technology (Vol.14, No. 4)Publication Date: 2017-07-01
Authors : Naresh Kumar Nagwani;
Page : 473-480
Keywords : SMS spam; priority sms; important sms; sms spam filtering; bi-level binary classification.;
Abstract
Short Message Service (SMS) traffic is increasing day by day and trillions of sms are sent and received by billions of users every day. SPAM messages are also increasing in same proportionate. Numbers of recent advancements are taking place in the field of sms spam detection and filtering. The objective of this work is twofold, first is to identify and classify spam messages from the collection of sms messages and second is to identify the priority or important sms messages from the filtered non-spam messages. The objective of the work is to categorize the sms messages for effective management and handling of sms messages. the work is planned in two level of binary classification wherein at the first level of classification the sms messages are categorized into the two classes spam and non-spam using popular binary classifiers, and then at the second level of classification non-spam sms messages are further categorized into the priority and normal sms messages. four state of the art popular text classification techniques namely, naïve bayes, support vector machine, latent dirichlet allocation and non-negative matrix factorization are used to categorize the sms text message at different levels of classification. The proposed bi-level classification model is also evaluated using the performance measures accuracy and f-measure. Combinations of classifiers at both levels are compared and it is shown from the experiments that support vector machine algorithm performs better for filtering the spam messages and categorizing the priority messages.
Other Latest Articles
- Muzzle Classification Using Neural Networks
- Image Compression based on Iteration-Free Fractal and using Fuzzy Clustering on DCT Coefficients
- A New (k, n) Secret Image Sharing Scheme (SISS)
- A New Way of Accelerating Web by Compressing Data with Back Reference-Prefer Geflochtener
- An Unsupervised Feed Forward Neural Network Method for Efficient Clustering
Last modified: 2019-05-08 20:17:08