A Supervised Method for Multi-keyword Web Crawling on Web Forums?
Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 2)Publication Date: 2014-02-28
Authors : A.Gowtham Dr.K.Deepa;
Page : 374-381
Keywords : Web crawler; page classification; forum crawler; URL based learning;
Abstract
Web forums are used by large number of users to post and share their comments with other users of various websites. The forums consist of many lists of topics on their boards with a large list of threads in each board. The users can create many threads and share their views in posts as well. In this paper a supervised web forum multi-keyword crawler is proposed to crawl relevant contents from the forum pages by reducing the delay. All the forums in the web have navigation paths that lead to the forum threads and these paths are connected by specific types of URLs. Thus the proposed method needs to recognize the various URLs by using the regular expression patterns within the forum. Accurate page classifies trained by using other forums can be used to classify the regular expression patterns and detect the URLs. The obtained results show that the proposed method is more reliable and accurate comparing to other existing methods.
Other Latest Articles
- DETECTING NODE REPLICATION ATTACKS IN STATIC AND MOBILE SENSOR NETWORKS USING SPRT?
- Secure Token Based Storage System to Preserve the Sensitive Data Using Proxy Re-Encryption Technique
- EFFICIENT GRIDDING AND SEGMENTATION FOR MICROARRAY IMAGES?
- Monitoring Factory Machine Status from Remote Location using GSM Technologies?
- FACE RECOGNITION BASED ATTENDANCE MARKING SYSTEM?
Last modified: 2014-02-20 13:22:24