A Survey On Various Web Template Detection And Extraction Methods
Journal: International Journal of Scientific & Technology Research (Vol.4, No. 3)Publication Date: 2015-03-15
Authors : Neethu Mary Varghese; Tenny Thomas Soman;
Page : 41-44
Keywords : Index Terms Cluster; Homogeneous web page; Heterogeneous web page; Page-level detection; Search engine; Site-level detection; Template Detection; Template Extraction.;
Abstract
Abstract In todays digital world reliance on the World Wide Web as a source of information is extensive. Users increasingly rely on web based search engines to provide accurate search results on a wide range of topics that interest them. The search engines in turn parse the vast repository of web pages searching for relevant information. However majority of web portals are designed using web templates which are designed to provide consistent look and feel to end users. The presence of these templates however can influence search results leading to inaccurate results being delivered to the users. Therefore to improve the accuracy and reliability of search results identification and removal of web templates from the actual content is essential. A wide range of approaches are commonly employed to achieve this and this paper focuses on the study of the various approaches of template detection and extraction that can be applied across homogenous as well as heterogeneous web pages.
Other Latest Articles
- Effect Of LLDPE Addition On The Reduction Of Feo From EAF Slags
- Image Reconstruction Using Pixel Wise Support Vector Machine SVM Classification.
- Image Reconstruction Using Multi Layer Perceptron MLP And Support Vector Machine SVM Classifier And Study Of Classification Accuracy
- URL Mining Using Agglomerative Clustering Algorithm
- The Social Development Dimension Of The Nursing Profession In Managing HIV Cases
Last modified: 2015-06-28 04:09:11