ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Advancing Clustering Techniques Algorithms for Large scale Data Sets

Journal: International Journal of Trend in Scientific Research and Development (Vol.10, No. 1)

Publication Date:

Authors : ;

Page : 52-58

Keywords : Clustering; HDBSCAN; Large-scale Data; Mini-batch K-means; Spectral Clustering.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Clustering is a fundamental technique in data analysis, used to group similar data points based on their inherent patterns. As data grows in volume, complexity, and dimensionality, traditional clustering methods such as K means and DBSCAN face significant challenges in terms of scalability, computational efficiency, and handling noisy data. This article explores advanced clustering techniques specifically designed to address the challenges posed by large scale datasets. Key methods discussed include scalable variants of traditional algorithms e.g., Mini batch K means , density based techniques e.g., HDBSCAN , graph based clustering e.g., spectral clustering , matrix factorization methods e.g., Non negative Matrix Factorization , and deep learning based approaches e.g., autoencoders and deep clustering frameworks . The article also delves into the computational efficiency of these algorithms, emphasizing parallel and distributed computing, approximation techniques, and algorithmic comparisons. Additionally, real world applications of clustering in fields such as bioinformatics, social networks, market segmentation, and multimedia data are highlighted. The article concludes by examining future research directions, including real time clustering, integration with AI techniques, and opportunities for hardware and software advancements to support large scale clustering. The evolving landscape of clustering methods presents exciting opportunities for more efficient and insightful analysis of large, complex datasets. Dr. Gopal Prasad Sharma | Prof. Dr. Manish Pokharel | Prof. Dr. Pawan Kumar Jha | Prof. Raj Kumar Thakur "Advancing Clustering Techniques: Algorithms for Large-scale Data Sets" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-10 | Issue-1 , February 2026, URL: https://www.ijtsrd.com/papers/ijtsrd99952.pdf Paper URL: https://www.ijtsrd.com/computer-science/data-miining/99952/advancing-clustering-techniques-algorithms-for-largescale-data-sets/dr-gopal-prasad-sharma

Last modified: 2026-03-12 19:00:18