HIDDEN WEB CRAWLERJournal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.9, No. 2)
Publication Date: 2020-02-28
Authors : Ashwini Bhardwaj; Shavita Shiwani; Vikas Verma; Harvir Singh;
Page : 137-146
Keywords : HIDDEN; WEB; CRAWLER; search engines;
Traditional search engines deal with the Surface Web which is a set of Web pages directly accessible through hyperlinks and ignores a large part of the Web called hidden Web which is a great amount of valuable information of online database which is “hidden” behind the query forms. To access to those information the crawler have to fill the forms with a valid data, for this reason we propose a new approach which use DIA and PSO technique in order to find the most promising keywords of a specific domain for automatic form submission. The effectiveness of proposed framework has been evaluated through experiments using real web sites and encouraging preliminary results were obtained.
Other Latest Articles
Last modified: 2020-04-19 23:48:59