Deep Web Mining Using C# Wrappers
Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 9)Publication Date: 2016-09-05
Authors : Rakesh Kumar Baloda; Praveen Kantha;
Page : 527-531
Keywords : Deep Web; Web Mining; Information Extraction; Wrappers; Crawling;
Abstract
World Wide Web (Internet) has immense collection of information that can be extracted for building knowledge base and business intelligence purposes. Generally that valuable information lies deep inside web databases and is not accessible directly through surface web crawling methods. This information can only be accessed via a focused crawler or wrapper program customized for a particular website. The wrapper can submit a set of values for form fields and imitate user actions such as mouse click or link navigations as performed on a web browser, thus saving the response page received from a web server and can then after extract information such as table data, links, image URLs etc after parsing the DOM structure of the document. We propose a C# crawler that can crawl a basic website and a set of related procedures (wrapper) which can extract (or mine) data from that resource by making use of regular expressions (Regex) patterns.
Other Latest Articles
- Electric Field Distribution of Wire-Duct Electrostatic Precipitator using FDM and MATLAP
- Barriers to the Implementation of Supply Chain Management- Case of Small to Medium Sized Contractors in Turkey
- Synthesis of Silver Nano Particles Using Plectranthus Ambonicus and Its Antimicrobial Activity on Polypropylene Non Woven Surgical Mask
- Study on Group Size and Group Composition of Great Indian one Horned Rhinoceros (R.Unicornis,Linn.) at Gorumara, Jaldapara and Kaziranga National Parks, India
- In-Vitro Antibacterial Activity of Leaf and Stem Extract of Passiflora edulis (Passion Fruit) Planted in Federal University of Agriculture Makurdi, Central Nigeria
Last modified: 2021-07-01 14:44:11