A Framework For Aggregating And Retrieving Relevant Information Using TF-IDF And Term Proximity In Support Of Maize Production
Journal: International Journal of Scientific & Technology Research (Vol.3, No. 3)Publication Date: 2014-03-15
Authors : Philemon Kasyoka; Waweru Mwangi; Michael Kimwele;
Page : 205-209
Keywords : Index Terms Inverse Document Frequency; Information Retrieval; RSS; Term Frequency; Term Proximity;
Abstract
Abstract This paper presents a framework for aggregating and retrieving relevant maize information using Term Frequency Inverse Document Frequency and Term Proximity. The framework aggregates information from agricultural websites and blogs through the use of RSS technology. Term Frequency Inverse Document Frequency is able to retrieve relevant documents from the aggregated RSS feeds however the presence of a query term within a retrieved document does not necessarily imply relevance. Documents with same similarity score do not necessarily have the same level of relevance. To mitigate that problem we implement a term proximity scoring approach that will be able to improve relevance in the top-k documents returned by TF-IDF. The approach for term proximity score uses both the span-based method and pair-based method to ensure effective proximity scoring. User preference profile is based on keywords which form user query while text documents are composed of RSS description content and RSS title tag content. Stemming is applied on query and document terms for better precision. This framework will ensure maize farmers get the most relevant information from online sources.
Other Latest Articles
- Lipoleiomyoma Of The Uterine Cervix About An Observation
- Analysis Of Pressure Transverse Between Pump Stations Of Fula Pipeline Under Different Operation Scenarios
- Trust In The Internet As A Delivery Channel The Retail Banks Perspective
- Comparative Review Of PMSM And BLDCM Based On Direct Torque Control Method
- Effect Of The Changes In The Weights On The Solution Of The Preemptive Weighted Linear Goal Programming Problems
Last modified: 2015-06-28 03:53:46