Real World Document Clustering Using Modified Balanced Iterative Reducing and Clustering using Hierarchies
Journal: International Journal for Scientific Research and Development | IJSRD (Vol.3, No. 12)Publication Date: 2016-03-01
Authors : Sunita N. Chaudhari; Praveen Kumar Gautam;
Page : 380-383
Keywords : Frequent Pattern Mining; High Utility Itemset Mining; Transaction Database;
Abstract
Clustering is �the process of organizing objects into groups whose members are similar in some way�. A cluster is therefore a collection of objects which are coherent internally, but clearly dissimilar to the objects belonging to other clusters.Document clustering is used in many fields such as data mining and information retrieval.to compare the clustering results of K-Mean approach ,agglomerative approach , partitioned approach for each of the criterion functionsusing real-world documents, and to establish theright clustering algorithm to produce high quality clustering ofreal-world document. The goal of a document clustering method is to reduce intra-cluster distances between documents, while exploiting inter-cluster distances (using an appropriate distance measure between documents). A distance measure (or, dually, similarity measure) thus lies at the heart of document clustering. The large variety of documents makes it almost unfeasible to create a general algorithm which can work best in case of all kinds of datasets.
Other Latest Articles
- Intelligent Translate System for Visually Challenged People
- Contour Free Level Set Method
- Review on Enhancement of Convective heat transfer using Nanofluids and Insert
- Numerical Simulation and Analysis of A Cost Effective Digital Fibre Optic Link
- Day of the Week Effect in Indian Automobile Sector With Reference To BSE Auto Index
Last modified: 2016-02-29 16:57:07