URL Mining Using Agglomerative Clustering AlgorithmJournal: International Journal of Scientific & Technology Research (Vol.4, No. 2)
Publication Date: 2015-02-15
Authors : Chinmay R. Deshmukh; R .R. Shelke;
Page : 236-238
Keywords : Index Terms Agglomerative Clustering Algorithm; URL Mining; Re-ranking; Query Log analysis.;
Abstract The tremendous growth of the web world incorporates application of data mining techniques to the web logs. Data Mining and World Wide Web encompasses an important and active area of research. Web log mining is analysis of web log files with web pages sequences. Web mining is broadly classified as web content mining web usage mining and web structure mining. Web usage mining is a technique to discover usage patterns from Web data in order to understand and better serve the needs of Web-based applications. URL mining refers to a subclass of Web mining that helps us to investigate the details of a Uniform Resource Locator. URL mining can be advantageous in the fields of security and protection. The paper introduces a technique for mining a collection of user transactions with an Internet search engine to discover clusters of similar queries and similar URLs. The information we exploit is a clickthrough data each record consist of a users query to a search engine along with the URL which the user selected from among the candidates offered by search engine. By viewing this dataset as a bipartite graph with the vertices on one side corresponding to queries and on the other side to URLs one can apply an agglomerative clustering algorithm to the graphs vertices to identify related queries and URLs.
Other Latest Articles
Last modified: 2015-06-28 04:08:23