A Bootstrap Aggregating Technique on Link-Based Cluster Ensemble Approach for Categorical Data Clustering
Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY (Vol.10, No. 8)Publication Date: 2013-07-15
Authors : S Pavan Reddy; U Sesadri;
Page : 1913-1921
Keywords : BSA; Clustering; categorical data; cluster ensembles; link-based similarity; data mining.;
Abstract
Although attempts have been made to solve the problem of clustering categorical data via cluster ensembles, with the results being competitive to conventional algorithms, it is observed that these techniques unfortunately generate a final data partition based on incomplete information. The underlying ensemble-information matrix presents only cluster-data point relations, with many entries being left unknown. The paper presents an analysis that suggests this problem degrades the quality of the clustering result, and it presents a BSA (Bootstrap Aggregation) is a machine learning ensemble?meta-algorithm?designed to improve the stability and accuracy along with a new link-based approach, which improves the conventional matrix by discovering unknown entries through similarity between clusters in an ensemble. In particular, an efficient BSA and link-based algorithm is proposed for the underlying similarity assessment. Afterward, to obtain the final clustering result, a graph partitioning technique is applied to a weighted bipartite graph that is formulated from the refined matrix. Experimental results on multiple real data sets suggest that the proposed link-based method almost always outperforms both conventional clustering algorithms for categorical data and well-known cluster ensemble techniques.
Other Latest Articles
- Secure Data Forwarding in Cloud Storage System by using UMIB Proxy
- Investigation, Formulation and Development of an Open GUI for the Touchscreen Smartphone
- Trusted Cloud Platform for Cloud Infrastructure
- Effective Risk Management In Organizations:The Nigerian Experience
- Insurance as Strategy for Flood Risk Management at Limpopo River Basin ? A decision making Process under Uncertainty
Last modified: 2016-06-29 18:58:28