Informative Content Extraction By Using Eifce Effective Informative Content ExtractorJournal: International Journal of Scientific & Technology Research (Vol.2, No. 6)
Publication Date: 2013-06-25
Authors : Chaw Su Win; Mie Mie Su Thwin;
Page : 136-144
Keywords : Index Terms Informative Content Extraction; Main Content Extraction; Web Page Segmentation;
Abstract Internet web pages contain several items that cannot be classified as the informative content e.g. search and filtering panel navigation links advertisements and so on. Most clients and end-users search for the informative content and largely do not seek the non-informative content. As a result the need of Informative Content Extraction from web pages becomes evident. Two steps Web Page Segmentation and Informative Content Extraction are needed to be carried out for Web Informative Content Extraction. DOM-based Segmentation Approaches cannot often provide satisfactory results. Vision-based Segmentation Approaches also have some drawbacks. So this paper proposes Effective Visual Block Extractor EVBE Algorithm to overcome the problems of DOM-based Approaches and reduce the drawbacks of previous works in Web Page Segmentation. And it also proposes Effective Informative Content Extractor EIFCE Algorithm to reduce the drawbacks of previous works in Web Informative Content Extraction. Web Page Indexing System Web Page Classification and Clustering System Web Information Extraction System can achieve significant savings and satisfactory results by applying the Proposed Algorithms.
Other Latest Articles
Last modified: 2013-08-10 23:36:18