WEB LINK SPAM IDENTIFICATION INSPIRED BY ARTIFICIAL IMMUNE SYSTEM AND THE IMPACT OF TPP-FCA FEATURE SELECTION ON SPAM CLASSIFICATION
Journal: ICTACT Journal on Soft Computing (IJSC) (Vol.4, No. 1)Publication Date: 2013-10-01
Authors : S. K. Jayanthi; S. Sasikala;
Page : 633-644
Keywords : Web Spam; Search Engine; TPP; FCA; AIRS;
Abstract
Search engines are the doorsteps for retrieving required information from the web. Web spam is a bad method for improving the ranking and visibility of the web pages in search engine results. This paper addresses the problem of the link spam classification through the features of the web sites. Link related features retrieved from the website are used to discriminate the spam and non-spam sites. AIS inspired algorithms are applied for the dataset and results are evaluated. Artificial immune systems are machine learning systems inspired by the principles of the natural immunology. It comprises of supervised learning schemes which can be adapted for the wide range of the classification problems.UK- WEBSPAM-2007 Dataset [8] is used for the experiments. WEKA [9] is used to simulate the classifiers. Artificial Immune Recognition algorithm seems to perform well than the other classes. Best classification accuracy attained is 98.89 by AIRS1 Algorithm. This seems to be good when comparing with the other classifiers accuracy available on the existing literature.
Other Latest Articles
- IDENTIFICATION OF ERYTHEMATO-SQUAMOUS SKIN DISEASES USING EXTREME LEARNING MACHINE AND ARTIFICIAL NEURAL NETWORK
- REVIEW OF PARALLEL GENETIC ALGORITHM BASED ON COMPUTING PARADIGM AND DIVERSITY IN SEARCH SPACE
- A DECENTRALIZED DYNAMIC LOAD BALANCING FOR COMPUTATIONAL GRID ENVIRONMENTS
- REVIEW OF HEART DISEASE PREDICTION SYSTEM USING DATA MINING AND HYBRID INTELLIGENT TECHNIQUES
- NOVEL RELEVANCE METRIC PREDICTION ALGORITHM FOR A PERSONALIZED WEB SEARCH
Last modified: 2013-12-05 19:55:42