Comprehensive analysis of different clustering algorithm and their impact over time series data
Journal: International Journal of Computer Techniques (Vol.2, No. 1)Publication Date: 2015-01-01
Authors : Ibrahim K A Abughali; SonajhariaMinz;
Page : 158-169
Keywords : homogeneity score; separation score; time series data; financial array data; microarray data;
Abstract
Clustering is an approach to divide data into number of groups on the basis of some mutual characteristics each group called clusters, consists of objects that are similar between themselves and dissimilar to objects between other groups. Nowadays a lot of work is being carried out for trend analysis on time series data set. The genetic algorithm suffers with the problem in case of time series data because they consider each time stamp as a single entity during clustering. Even some algorithm still give good result on some type of time series data set but overall there is no generalized algorithm which considers different type of time series data set.This paper is presents comprehensive analysis over different type of clustering algorithms like k-means, hierarchical, SOM and GMM on two type of time series data, microarray and financial. These algorithms are compared on the following factors: size of data, number of clusters, type of data set, homogeneity score, separation score, silhouette coefficient etc, and presented extensive conclusion on the basis of the performance, quality and accuracy of the generic clustering algorithms.
Other Latest Articles
- Optimal Kinematic Synthesis of Planar Mechanism, Part I: Offset Crank-Slider Mechanism
- A Novel Classification Algorithm based on Contextual Information using FCM Classifier for Brain Tumor Diagnosis using MR Images
- IT Governance Ontology Building Process : Example of developing Audit Ontology
- DOMI: A Dynamic Ordering Multi-field Index for Data Processing
- Digital Image Steganography using a Novel Uniform Distortion Function with all Possible DCT Magnitudes
Last modified: 2015-07-09 17:02:44