EXTRACTION OF WEB BLOCKS FROM WEB PAGES AND ANALYSIS OF EXTRACTION ALGORITHMS
Journal: International Journal of Scientific & Technology Research (Vol.3, No. 2)Publication Date: 2014-02-15
Authors : S.K.SHIRGAVE; V.B.BINAGE;
Page : 169-178
Keywords : Index Terms Fragment; ContentExtractor; DeSeA.;
Abstract
Abstract Web page can be divided in various blocks called as fragments. A fragment is a portion of a web page which has a distinct theme or functionality and is distinguishable from the other parts of the page.Dividing web pages into fragments has provided significant benefits. Good methods are needed for dividing web pages into fragments. Manual fragmentation of web pages is expensive error prone and un-scalable. Due to these problems extraction of web fragments using Content extractor algorithm and DeSeA algorithm have been widely used.The proposed work has following features 1Detect fragment using content extractor algorithm.2Extraction of fragment detected in step 1.3Detect fragment using DeSeA algorithm.4Extraction of fragment detected in step 3.5Analyze results of extracted fragment using above algorithms.
Other Latest Articles
- Synthesis Characterization And Electrical Conductivity Of Poly 2- ChloroanilineMMT And Poly 2-ChloroanilineNa-Bentonite Nano Composites In The Presence Of Surfactants
- Problem Solving Management Using Six Sigma Tools 26 Techniques
- A Case For Public Financing Of Broadband Internet Infrastructure In Ghana
- Semantic Similarity Measure Using Information Content Approach With Depth For Similarity Calculation
- Study The Relationship Between Emotional Intelligence Of The Managers And Their Entrepreneurial Personality In Air-Handling Units And Industrial Diffusers Manufacturers With Using Artificial Neural Network
Last modified: 2015-06-28 03:51:43