FoCUS ? Forum Crawler Under Supervision?
Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.3, No. 8)Publication Date: 2014-08-30
Authors : V.Rajapriya;
Page : 79-84
Keywords : Page classification; URL pattern learning; Sentimental analysis;
Abstract
Forum Crawler Under Supervision (FoCUS) is a supervised web-scale forum crawler. The web contains large data and innumerable websites that are monitored by a tool or program known as crawler. The goal is to crawl relevant forum content from the web with minimal overhead. Forums have different layouts or styles and are powered by different forum software packages. They have similar implicit navigation paths connected by specific URL types to lead users from entry pages to thread pages. It reduces the web forum crawling problem to a URL-type recognition problem. It also shows how to learn accurate and effective regular expression patterns of implicit navigation paths from automatically created training sets using aggregated results from weak page type classifiers. These type classifiers can be trained and applied to large set of unseen forums. It produces the best effectiveness and addresses the scalability issue and includes the concept called sentimental analysis.
Other Latest Articles
- Automated Window 8 Security and Safety System?
- THE PRACTICE OF FRENCH JUSTICE ARTICLE 228 OF THE UN CONVENTION ON THE LAW OF THE SEA
- IMPLEMENTATION OF THE PUBLIC-PRIVATE PARTNERSHIP IN THE FIELD OF PRODUCTION AND CONSUMPTION WASTE IN CONSTITUENT AND MUNICIPAL ENTITIES OF THE RUSSIAN FEDERATION
- LINGUISTIC IMAGE OF AN IRISHMAN IN THE CONTEXT OF HISTORIC REFLECTION
- THE PECULIAR FEATURES OF THE CONCEPT TEUFEL REPRESENTATION IN PROVERBS AND SAYINGS IN GERMAN LANGUAGE
Last modified: 2014-08-09 01:26:56