Improved Vector Space Model TF/IDF Using Lexical Relations
Journal: International Journal of Advanced Computer Research (IJACR) (Vol.5, No. 21)Publication Date: 2015-12-01
Authors : Minh Chau Huynh; Pham Duy Thanh Le; Trong Hai Duong;
Page : 334-346
Keywords : Sector space model; TF/IDF; Semantics; Information retrieval; Natural language processing.;
Abstract
Current vector space model, for instance TF/IDF, has not yet taken into account the relations between terms; it only combines the term frequency in a document and the inverse document frequency in whole database to identify importance-score (weight) of a term respect with the document. Here we discover lexical relations among terms in the document to improve the vector space model TF/IDF. The weight generated from TF/IDF for each term, which is improved by lexical relations among related terms in the document. We evaluate the proposed method using documents selected from Wikipedia. The result shown that the proposed method is significant effective.
Other Latest Articles
- Comparison of the Helicobacter Pylori Diagnosis Methods with Analytic Network Process
- Integrated Usability Testing
- Mode of getting Thermal Energy Depending of the Average Outside Temperature on the City of Bitola
- Influence of Intelletual Stimulation and Conflict Resolution on Project Implementation: A case of Constituency Development Fund Construction Projects in Public Secondary Schools in Kisumu County, Kenya
- Stiffness Analysis of the Sarafix External Fixator based on Stainless Steel and Composite Material
Last modified: 2015-11-30 15:54:49