PCFA: Mining of Projected Clusters in High Dimensional Data Using Modified FCM Algorithm
Journal: The International Arab Journal of Information Technology (Vol.11, No. 2)Publication Date: 2014-03-01
Authors : Ilango Murugappan; Mohan Vasudev;
Page : 168-177
Keywords : Clustering; FCM; Modified FCM; k-mean clustering; accuracy; memory usage; computation time;
Abstract
Data deals with the specific problem of partitioning a group of objects into a fixed number of subsets, so that the similarity of the objects in each subset is increased and the similarity across subsets is reduced. Several algorithms have been proposed in the literature for clustering, where k-means clustering and Fuzzy C-Means (FCM) clustering are the two popular algorithms for partitioning the numerical data into groups. But, due to the drawbacks of both categories of algorithms, recent researches have paid more attention on modifying the clustering algorithms. In this paper, we have made an extensive analysis on modifying the FCM clustering algorithm to overcome the difficulties possessed by the k-means and FCM algorithms over high dimensional data. According to, we have proposed an algorithm, called Projected Clustering based on FCM Algorithm (PCFA). Here, we have utilized the standard FCM clustering algorithm for sub-clustering high dimensional data into reference centroids. The matrix containing the reference values is then fed as an input to the modified FCM algorithm. Finally, experimentation is carried out on the very large dimensional datasets obtained from the benchmarks data repositories and the performance of the PCFA algorithm is evaluated with the help of clustering accuracy, memory usage and the computation time. The evaluation results showed that, the PCFA algorithm shows approximately 20% improvement in the execution time and 50% improvement in memory usage over the PCKA algorithm
Other Latest Articles
- Enhanced Core Stateless Fair Queuing with Multiple Queue Priority Scheduler
- Face Recognition Using Adaptive Margin Fisher’s Criterion and Linear Discriminant Analysis (AMFC-LDA)
- Employing Machine Learning Algorithms to Detect Unknown Scanning and Email Worms
- Elimination of Repeated Occurrences in Multimedia Search Engines
Last modified: 2019-11-17 19:40:19