Role of References in Similarity Estimation of Publications
Journal: The International Arab Journal of Information Technology (Vol.13, No. 3)Publication Date: 2016-05-01
Authors : Muhammad Shoaib; Ali Daud; Malik Khiyal;
Page : 1004-1011
Keywords : AND; references; vector space model; cosine similarity; citation matching.;
Abstract
Similarity estimation among publications is very important in classification and clustering techniques for grouping,indexing, citation matching and Author Name Disambiguation (AND) purposes. Publication attributes are basic sources of information and play important role in similarity estimation. Most of the works in AND use title, co-authors and venue attributes for estimating similarity among publications. Many other sources of information such as self-citations, shared citations and references, topic of the publications and abstracts have also been employed to estimate optimal similarity among publications. Recently, in the field of Academic Document Clustering (ADC), reference marker contexts have been utilized for this purpose. However, the use of citations and references is less common since only a few databases include this information. In this paper, we propose to use two components of references (co-authors and titles of references) as sources of information and investigate the importance of these components in similarity estimation. To the best of our knowledge, this is the first endeavour to exploit components of references as sources of information. Experiments conducted on real publication datasets reveal that these components of references are significant source of information for similarity estimation among publications.
Other Latest Articles
- A Novel Baseline Estimation Method for Arabic Handwritten Text Based on Exploited Components of Voronoi Diagrams
- An Approach for Clustering Class Coupling Metrics to Mine Object Oriented Software Components
- HF and DFT ANALYSIS OF STRUCTURE AND ENERGETICS OF Zn(H2O)n FOR n=1-10
- BIOSORPTION OF METHYL ORANGE DYE BY YELLOW MUSTARD SEEDS (Sinapis Alba L.)
- Information Security Risk Plans within Enterprise Architecture Framework
Last modified: 2019-11-14 18:09:09