Clustering Based on Correlation Fractal Dimension Over an Evolving Data Stream
Journal: The International Arab Journal of Information Technology (Vol.15, No. 1)Publication Date: 2018-01-01
Authors : Anuradha Yarlagadda Murthy Jonnalagedda; Krishna Munaga;
Page : 1-9
Keywords : Cluster; data stream; fractal; self-similarity; sliding window; damped window.;
Abstract
Online clustering, in an evolving high dimensional data is an amazing challenge for data mining applications.Although, many clustering strategies have been proposed, it is still an exciting task since the published algorithms fail to do well with high dimensional datasets, finding arbitrary shaped clusters and handling outliers. Knowing fractal characteristics of dataset can help abstract the dataset and provide insightful hints in the clustering process. This paper concentrates on presenting a novel strategy, FractStream for clustering data streams using fractal dimension, basic window technology, and damped window model. Core fractal-clusters, progressive fractal-cluster, outlier fractal clusters are identified, aiming to reduce search complexity and execution time. Pruning strategies are also employed based on the weights associated with each cluster, which reduced the usage of main memory. Experimental study of this paper over a number of data sets demonstrates the effectiveness and efficiency of the proposed technique.
Other Latest Articles
- The test of reincarnation of the soul by DNA and IRIS scanner (Part One)
- The test of reincarnation of the soul by DNA and IRIS scanner (Part Two)
- The test of reincarnation of the soul by DNA and IRIS scanner (Part Three)
- Design and Implementation of a Fine-grained Resource Usage Model for the Android Platform
- Improved Two-Factor Authenticated Key Exchange Protocol
Last modified: 2019-04-29 16:25:28