Text Document Annotation and Retrieval Based on Content of the Document and Query Workload
Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 5)Publication Date: 2016-05-05
Authors : Arunima P V; Ravinarayana B;
Page : 2398-2403
Keywords : Attribute-Value pair Annotation; Annotation Generation; Result Ranking; Query Workload;
Abstract
Performing search and retrieval in large collection of textual data is a complex task. For effective searching of text documents, annotations are used. Annotations are the tags, s, attribute-value pairs, comments or summary that are attached to a document or a part of the document. Annotations can be considered as a structured representation of unstructured data. Since manually annotating each document in a large collection is not feasible, automatic annotation techniques are used. In this work, an automatic annotation generation technique based on the content of the document and query workload is introduced. Annotations are generated in the form of attribute-value pairs as they are more expressive than simple annotations. The system generates both attributes and values for a document by analyzing the content of the document, annotations of the existing documents and query workload. These annotations are later used during the search and retrieval process for matching with the queries given by the user. As an enhancement to the system, a new ranking method for ranking the retrieved documents is also introduced.
Other Latest Articles
- Ground State Energies of a Sextic Anharmonic Oscillator Including Quartic Anharmonicity
- The Impact of the Internal Variables on Water Security in the Middle East (Water is a Foundation for Human Prosperity)
- Climatic Elements & Their Impact on Building Design
- Impact of Nutrition Education on Nutritional Status and Daily Dietary Pattern of College Going Girls
- Assessment of Ground Water Quality: Selected Villages of Mahabubnagar Mandal & District, Telangana State (India)
Last modified: 2021-07-01 14:37:34