A Comparative Study on K-Means Clustering and Agglomerative Hierarchical Clustering
Journal: International Journal of Emerging Trends in Engineering Research (IJETER) (Vol.8, No. 5)Publication Date: 2019-10-15
Authors : Karthikeyan B. Dipu Jo George G. Manikandan; Tony Thomas;
Page : 1600-1604
Keywords : agglomerative hierarchical clustering; centroids; dendrograms; k-means clustering;
Abstract
Clustering is a well-established unsupervised data mining approach that group data points based on similarities. Clustering entities will give insights into the characteristics of different groups. Clustering results in minimization of the dimensionality of data set when you are dealing with a myriad number of data. The higher the homogeneity within the cluster and the higher the differences between the clusters, the finer the cluster will be. Clusters are mainly of two types: 1) Soft clustering: Based on the probability that a data point will belong to a specific cluster and, 2) Hard clustering: Data points are separated into independent clusters. Among hundreds of clustering algorithms, they can be labeled into one of following models such as connectivity, density, distribution and centroid model. This paper attempts to differentiate two widely used clustering techniques, k-means clustering and hierarchical clustering which belong to the centroid and connectivity models respectively. The comparison will be based on execution time and memory usage of both these algorithms when different sets of a delivery fleet driver data set are manipulated using these algorithms.
Other Latest Articles
- The Role of Local and Regional Conflicts of the Influential Dynasties in lack of Development of Shushtar in Afshareyeh and Zandiyeh Periods
- Classification Performance for Credit Scoring using Neural Network
- Thermomechanical Solutions for Functionally Graded Beam subject to various Boundary Conditions
- Machine Learning-Structural Equation Modeling Algorithm: The Moderating role of Loyalty on Customer Retention towards Online Shopping
- A Real Time Linearization of NTC Thermistor using Hybrid Neuro-Fuzzy Logic based on VLSI Technology
Last modified: 2020-06-15 16:01:21