ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Clustering in big data analytics: a systematic review and comparative analysis (review article)

Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.23, No. 5)

Publication Date:

Authors : ;

Page : 967-979

Keywords : big data; clustering; data mining; empirical evaluations; performance metrics;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In the modern world, the widespread use of information and communication technology has led to the accumulation of vast and diverse quantities of data, commonly known as Big Data. This necessitates the need for novel concepts and analytical techniques to help individuals extract meaningful insights from rapidly increasing volumes of digital data. Clustering is a fundamental approach used in data mining to retrieve valuable information. Although a wide range of clustering methods have been described and implemented in various fields, the sheer variety complicates the task of keeping up with the latest advancements in the field. This research aims to provide a comprehensive evaluation of the clustering algorithms developed for Big Data highlighting their various features. The study also conducts empirical evaluations on six large datasets, using several validity metrics and computing time to assess the performance of the clustering methods under consideration.

Last modified: 2023-10-24 18:32:08