ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

EXTRACTION OF WEB BLOCKS FROM WEB PAGES AND ANALYSIS OF EXTRACTION ALGORITHMS

Journal: International Journal of Scientific & Technology Research (Vol.3, No. 2)

Publication Date:

Authors : ; ;

Page : 169-178

Keywords : Index Terms Fragment; ContentExtractor; DeSeA.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Abstract Web page can be divided in various blocks called as fragments. A fragment is a portion of a web page which has a distinct theme or functionality and is distinguishable from the other parts of the page.Dividing web pages into fragments has provided significant benefits. Good methods are needed for dividing web pages into fragments. Manual fragmentation of web pages is expensive error prone and un-scalable. Due to these problems extraction of web fragments using Content extractor algorithm and DeSeA algorithm have been widely used.The proposed work has following features 1Detect fragment using content extractor algorithm.2Extraction of fragment detected in step 1.3Detect fragment using DeSeA algorithm.4Extraction of fragment detected in step 3.5Analyze results of extracted fragment using above algorithms.

Last modified: 2015-06-28 03:51:43