VWDRE – A VISION-BASED APPROACH FOR MINING DATA FROM SEARCH ENGINE RESULT PAGES
Journal: International Journal of Civil Engineering and Technology (IJCIET) (Vol.8, No. 9)Publication Date: 2017-09-22
Authors : M. SATHYA S. MADHAN U. JOTHILAKSHMI;
Page : 973-982
Keywords : Vision-Based; Wrapper; Data Extraction; Search Engine Result Pages; DOM tree.;
Abstract
The data extraction from the dynamically generated web pages is a challenging factor because the result of the search engines are always different for every query submitted. Many techniques were proposed to address this issue but most of them have the common problem of language-dependency. In order to overcome the limitations of previous works, there are few ways which analyze visual features of the web page. In this paper, we proposed a new vision-based approach which is independent of the code used. It broadly utilizes the visual features on the search engine result pages to locate the data region so asto mine the data records from it. We develop a clustering by similarity algorithm to check the similarity of data records. Also, we propose a technique to generate the wrapper for data record extraction by examining the multiple result pages from the same search engine.
Other Latest Articles
- Xanthogranulomatous Pyelonephritis in association with Chronic kidney disease: A novel step in early detection of Renal Tuberculosis
- AUTOMATION OF PARCEL DELIVERY COLLECTION USING IOT – SMART FREIGHT BOX
- Retrospective study of primary extranodal abdominal lymphoma from a tertiary healthcare centre
- BIO-INSPIRED BUILT ENVIRONMENTS FOR CLIMATE CHANGE: DEVELOPING STRATEGIES FOR ADAPTATION AND MITIGATION
- SAFETY AND HEALTH ISSUES DURING PRINTING INK PRODUCTION PROCESS
Last modified: 2018-04-16 17:37:26