ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

CLUSTERING BASED DOCUMENT SUMMARIZATION

Journal: International Journal of Emerging Trends & Technology in Computer Science (IJETTCS) (Vol.5, No. 6)

Publication Date:

Authors : ; ;

Page : 80-85

Keywords : Document Summarization; Sentence Preprocessing; Natural Language processing (nlp); Clustering; Word Net.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Abstract: Document summarization involves summarizing document as the information is continuously increasing with such a huge amount. Users do not have much time to spend reading thousands of lines. Today users want maximum information which describes everything and occupies minimum space. This paper discusses an improved approach for document summarization by using clustering. Summarization is process of producing single summaries from a document. The three major problems that were introduced in single document summarization were coped in k means clustering summarization i.e. coping with redundancy, coherency in summary, identifying difference in sentences. To identify similarity in documents various similarity measures are used i.e. similarity between the sentences of documents and then grouping them in clusters based on their tf*idf values of the words.

Last modified: 2017-01-07 14:04:34