Keyword extraction from single documents using mean word intermediate distance
Journal: International Journal of Advanced Computer Research (IJACR) (Vol.6, No. 25)Publication Date: 2016-07-31
Authors : Sifatullah Siddiqi; Aditi Sharan;
Page : 138-145
Keywords : Keyword extraction; Word means intermediate distance; Clustering; Standard deviation.;
Abstract
Keyword extraction is an important task in text mining. In this paper a novel, unsupervised, domain independent and language independent approach for automatic keyword extraction from single documents have been proposed. We have used the word intermediate distance vector and its mean value to extract keywords. We have compared our approach with results from the standard deviation of intermediate distances approach as standard and found that there is heavy overlapping between the results of both approaches with the advantage that our approach is faster, especially in case of long documents as it removes the need to compute the standard deviation of word intermediate distance vector. Two famous works viz. “Origin of Species” and “A Brief History of Time” to demonstrate the experimental results have been used. Experiments show that the proposed approach works almost as better as the standard deviation approach and the percentage overlap between top 30 extracted keywords is more than 50%.
Other Latest Articles
- IMPROVING THEMETHOD FOR EVALUATION SURFACE TREATMENT STABILITY OF ROAD PAVEMENT
- A speaker model clustering method based on space position
- Antecedents of software-as-a-service (SaaS) adoption: a structural equation model
- ANALYTICAL DETERMINATION OF RESONANCE FREQUENCY VIBRATIONS ACTIVE WORKING ORGAN THE CASSETTE FORM
- STYDY OF INTENSIVE DRY PLASTER MIXING IN PoltNTU CONSTRUCTION MIXING MACHINE PHU-4 BUNKER
Last modified: 2016-08-09 17:22:37