ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A Comparative Study on K-Means Clustering and Agglomerative Hierarchical Clustering

Journal: International Journal of Emerging Trends in Engineering Research (IJETER) (Vol.8, No. 5)

Publication Date:

Authors : ; ;

Page : 1600-1604

Keywords : agglomerative hierarchical clustering; centroids; dendrograms; k-means clustering;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Clustering is a well-established unsupervised data mining approach that group data points based on similarities. Clustering entities will give insights into the characteristics of different groups. Clustering results in minimization of the dimensionality of data set when you are dealing with a myriad number of data. The higher the homogeneity within the cluster and the higher the differences between the clusters, the finer the cluster will be. Clusters are mainly of two types: 1) Soft clustering: Based on the probability that a data point will belong to a specific cluster and, 2) Hard clustering: Data points are separated into independent clusters. Among hundreds of clustering algorithms, they can be labeled into one of following models such as connectivity, density, distribution and centroid model. This paper attempts to differentiate two widely used clustering techniques, k-means clustering and hierarchical clustering which belong to the centroid and connectivity models respectively. The comparison will be based on execution time and memory usage of both these algorithms when different sets of a delivery fleet driver data set are manipulated using these algorithms.

Last modified: 2020-06-15 16:01:21