BUILDING HIGHLY SPECIALISED WEB CRAWLER USING B - TREE SEARCH AND HTML PARSER
Journal: International Journal OF Engineering Sciences & Management Research (Vol.3, No. 5)Publication Date: 2016-05-30
Authors : Prateek Raman; Ravi Kant Gautam; Ravi Yadav; Manish Kumar Sharma;
Page : 98-102
Keywords : Best First Search; Priority Strategy of Web Grasping; B - tree Algorithm; Web Revisiting strategy Recommendation System .;
Abstract
Web crawlers are Internet bot that automatically traverse the hyper - link structure of the world wide web in order to locate and retrieve information. This paper describes a web crawling approach based on B - tree search and HTML Parser. As the goal of crawler is to selectively seek out pages that are relevant to given keywords. Rather than collecting and indexing all available web documents to be able to answer all possible queries, a crawler analyze its crawl boundary to hit upon the links that are likely to be most relevant for the crawl, and avoids irrelevant links of the document.
Other Latest Articles
- LEAN MANUFACTURING AS LINE - BALANCING CONCEPT
- EVALUATING PRIME MINISTER EMPLOYMENT GENERATION PROGRAMME (PMEGP) IN RURAL AREA OF KOLHAPUR DISTRICT
- Rethinking Morphological Analysis Application for Concept Synthesis in Engineering Design
- Seismic Response of Base Isolated Liquid Storage Tanks under Near Fault Ground Motions
- Influence of Strengthening the Infill Walls with Perforated Steel Plates on the Behavior of RC Frames
Last modified: 2016-05-21 15:20:46