Efficient Way To Identify User Aware Rare Sequential Patterns In Document Streams
Journal: International Journal of Trend in Scientific Research and Development (Vol.1, No. 4)Publication Date: 2017-05-28
Authors : Swati V. Mengje; Rajeshri R. Shelke;
Page : 7-249
Keywords : Web mining; sequential patterns; document streams; rare events; pattern-growth; dynamic programming.;
Abstract
Documents created and distributed on the Internet are ever changing in various forms. Most of existing works are devoted to topic modeling and the evolution of individual topics, while sequential relations of topics in successive documents published by a specific user are ignored. In order to characterize and detect personalized and abnormal behaviours of Internet users, we propose Sequential Topic Patterns (STPs) and formulate the problem of mining User-aware Rare Sequential Topic Patterns (URSTPs) in document streams on the Internet. They are rare on the whole but relatively frequent for specific users, so can be applied in many real-life scenarios, such as real-time monitoring on abnormal user behaviours. Here present solutions to solve this innovative mining problem through three phases: pre-processing to extract probabilistic topics and identify sessions for different users, generating all the STP candidates with (expected) support values for each user by pattern-growth, and selecting URSTPs by making user-aware rarity analysis on derived STPs. Experiments on both real (Twitter) and synthetic datasets show that our approach can indeed discover special users and interpretable URSTPs effectively and efficiently, which significantly reflect users' characteristics. KEYWORDS: Web mining, sequential patterns, document streams, rare events, pattern-growth, dynamic programming.
Swati V. Mengje | Prof. Rajeshri R. Shelke "Efficient Way To Identify User Aware Rare Sequential Patterns In Document Streams" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-1 | Issue-4 , June 2017, URL: http://www.ijtsrd.com/papers/ijtsrd101.pdf
Other Latest Articles
- Female Aesthetic in Anita Desai’s Where Shall We Go This Summer?
- Toxicity and Disruptive Impacts of Novaluron, A Chitin Synthesis Inhibitor, on Development and Metamorphosis of The Olive Leaf Moth Palpita unionalis
- Digital Education and Smart Country South Korea
- Parametric Study Of Multisoried R.C.C. Flat Slab Structure Under Seismic Effect Having Different Plan Aspect Ratio And Slenderness Ratio
- Semantic Web: A Study on Web Service Composition Approaches
Last modified: 2017-05-28 21:28:06