ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A Cluster Based Approach for Classification of Web Results

Journal: International Journal of Advanced Computer Research (IJACR) (Vol.4, No. 17)

Publication Date:

Authors : ; ;

Page : 934-938

Keywords : Text mining; clustering; classification; IF-IDF.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Nowadays significant amount of information from web is present in the form of text, e.g., reviews, forum postings, blogs, news articles, email messages, web pages. It becomes difficult to classify documents in predefined categories as the number of document grows. Clustering is the classification of a data into clusters, so that the data in each cluster share some common trait ? often vicinity according to some defined measure. Underlying distribution of data set can somewhat be depicted based on the learned clusters under the guidance of initial data set. Thus, clusters of documents can be employed to train the classifier by using defined features of those clusters. One of the important issues is also to classify the text data from web into different clusters by mining the knowledge. Conforming to that, this paper presents a review on most of document clustering technique and cluster based classification techniques used so far. Also pre-processing on text dataset and document clustering method is explained in brief.

Last modified: 2015-03-05 18:59:22