ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Novel Web Data Extraction Using Template Extraction and Filtering Non Information

Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 12)

Publication Date:

Authors : ; ;

Page : 2102-2105

Keywords : Information Filtering; Non Information; Template Extraction Unsupervised learning; Web data extraction;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Web is huge repository of information which contains different types of data in various forms. As we need to extract only the relevant data from web. Web data extractors are used to automatically extract the data from web documents. To study the problems related to web data Extraction different scientific tools are used which has broad range of applications. As we want only relevant data is to be extracted from the web. In our proposed system data is extracted using template extraction. Template matching will be based upon depth and data similarity and also removing the non-information part from the web pages by using filtering. The proposed system works on input document of variable depth.

Last modified: 2021-07-01 14:28:06