Specific Personal Alias Withdrawal from Web and Clustering of Similar Web Documents
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 11)Publication Date: 2015-11-05
Authors : Snehal S. Shinde; Prakash R. Devale;
Page : 2503-2506
Keywords : Web mining; ranking; clustering; web text analysis; co-occurrence frequency;
Abstract
There are many names available for a person, place or an entity on the web. If accurate alias of a particular individual is identified it becomes very useful in numerous web related tasks like information extraction, relation extraction, biomedical fields, sentiment analysis, personal name disambiguation, etc. Here, one method is projected based on referential ambiguity to find the correct alias for a given name. After accepting real name as input lexical patterns are achieved from the web. Candidate aliases are extracted with the help of these patterns. The candidate aliases are ranked using various ranking scores like co occurrence frequency, web dice, hub discounting, and degree distribution. This method improves the recall and attains a statistically considerable mean reciprocal rank. Using candidate aliases and data files, related web documents are bunched or grouped. Grouping achieves high accuracy and reduces the complexity.
Other Latest Articles
- Productivity and Carbon Balance of Soybean-Corn Intercrops are affected by the Density of Corn
- Antibacterial Behavior of Weed Plants from Aseer, Saudi Arabia and Their Possible Mode of Action
- Findings of Anisometropic Amblyopia in Children
- Study of Advances in Management of 68 Cases of Ectopic Gestation
- The Geology of Dumne Area in Southeastern Hawal Massif, Northeastern Nigeria
Last modified: 2021-07-01 14:26:37