An Elegant Fusion of Concurrent Crawling and Page Rank Technique for Spidering Websites
Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 6)Publication Date: 2014-06-30
Authors : Smita Marwadi; Neelabh Sao;
Page : 247-255
Keywords : Web Crawler; Web Spider; concurrent crawler;
Abstract
The World Wide Web is expanding day by day. With the great growth of the Web, it has become a massive challenge for the all-purpose single process crawlers (A crawler is a program that downloads and stores Web pages, often for a Web search engine) to locate the resources that are precise and relevant in an appropriate amount of time, so more enhanced and convincing algorithms are in stipulate. Thus it becomes vital to improve the crawling procedure, in order to finish downloading pages in a sensible amount of time. Web crawler which employs multi-processing to allow multiple crawler processes to run concurrently. We have proposed a resourceful concurrent crawler that is fusion of page rank and concurrent multi-process crawler, offering a means to efficiently crawl the Web and presenting a scalable solution that allows crawl speeds to be tuned as needed.
Other Latest Articles
- The national existence motives in V. Pidpaly’s collection of poems ?Skovoroda’s thoughts?
- Specificity of romantic grotesque in E. T. A. Hoffman’s works “Little Zaches” and “The Golden Pot”
- Interrelation specificity of life-purpose and value orientations of the students with different features of self-attitude
- The essence of musical talent of the child
- Features of the checking of independent work of students system are in the humanitarian universities of France
Last modified: 2014-06-18 01:32:32