Performance Analysis Of Similarity Coefficients In Web Information Retrieval Using Genetic Algorithm
Journal: International Journal of Scientific & Technology Research (Vol.3, No. 8)Publication Date: 2014-08-15
Authors : Vikas Thada; Vivek Jaglan;
Page : 9-14
Keywords : Index Terms Algorithm; Coefficients; Crawling; Focused; Genetic; Information; Ranking; Retrieval; Web.;
Abstract
Abstract Crawling is a process in which web search engines collect data from the web. Focused crawling is a special type of crawling process where crawler look for information related to a predefined topic1.In this paper a method for finding out the most relevant document among a set of documents for the given set of keyword is presented. Relevance checking is done with the help of Rogers-Tanimoto MountFord and Baroni-UrbaniBuser similarity coefficients. The method uses genetic algorithm to show that the average similarity of documents to the query increases when Probability of mutation is taken as low and Probability of crossover is taken as high. The method does the performance analysis of different similarity coefficients on the same set of documents and selects the best combination of ProC and ProM to achieve maximum relevancy using of Rogers-Tanimoto MountFord and Baroni-UrbaniBuser similarity coefficients.
Other Latest Articles
- Study To Evaluate The Scope And Nature Of WIT Wireless And Information Technology In Patient Health Monitoring Sector
- Crowdsourcing Social Media For Robot Learning A Review
- An Investigation Of The Physical And Mineralogical Characteristics Of The Quaternary Formation Of The Chad Basin Nigeria
- Determination Of The Presence Of Flavonoids In The Leaves Seeds And Branches Of The Matured Plant Indigofera Arrecta
- Role Of Third Party Agency In FPSOFSO EPCIC Project
Last modified: 2015-06-28 03:59:43