EXTRACTION OF WEB BLOCKS FROM WEB PAGES AND ANALYSIS OF EXTRACTION ALGORITHMSJournal: International Journal of Scientific & Technology Research (Vol.3, No. 2)
Publication Date: 2014-02-15
Authors : S.K.SHIRGAVE; V.B.BINAGE;
Page : 169-178
Keywords : Index Terms Fragment; ContentExtractor; DeSeA.;
Abstract Web page can be divided in various blocks called as fragments. A fragment is a portion of a web page which has a distinct theme or functionality and is distinguishable from the other parts of the page.Dividing web pages into fragments has provided significant benefits. Good methods are needed for dividing web pages into fragments. Manual fragmentation of web pages is expensive error prone and un-scalable. Due to these problems extraction of web fragments using Content extractor algorithm and DeSeA algorithm have been widely used.The proposed work has following features 1Detect fragment using content extractor algorithm.2Extraction of fragment detected in step 1.3Detect fragment using DeSeA algorithm.4Extraction of fragment detected in step 3.5Analyze results of extracted fragment using above algorithms.
Other Latest Articles
Last modified: 2015-06-28 03:51:43