ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Efficient Text Clustering for Distributed Network

Journal: International Journal of Science and Research (IJSR) (Vol.3, No. 6)

Publication Date:

Authors : ; ;

Page : 362-365

Keywords : text clustering; k- means; p2p network; DHT; centroid;

Source : Downloadexternal Find it from : Google Scholarexternal


Text clustering is an important technique for improving the quality of information retrieval in both centralized and distributed environment. Most of the existing text clustering algorithms are designed for central execution, which are not work well on highly distributed environment. In this paper, an algorithm called probabilistic text clustering for distributed network such as peer to peer network is proposed. This algorithm achieves high scalability for assigning documents to clusters. It enables a peer to compare each of its documents only with very few selected clusters, maintain cluster quality.

Last modified: 2014-06-23 15:47:36