Document Classification Using Part of Speech in Text Mining

Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 12)

Publication Date: 2015-12-05

Authors : Sonam Tripathi; Tripti Sharma;

Page : 2004-2008

Keywords : Text mining; Association rule; Sequential pattern mining; Closed pattern mining; Frequent pattern mining;

Source : Download Find it from : Google Scholar

Abstract

Text mining is a practice that is used to find beneficial in arrangement from the large amount of data sets. Data mining has guidelines called as frequent pattern and association rule that is important for finding frequent patterns. Text Mining is the detection by computer of new, previously unidentified in arrangement by automatically mining in arrangement from dissimilar written resources. Text mining methods are the fundamental and permitting tools for efficient organization, triangulation, retrieval and summarization of large document quantity. The problem is more often than not decomposed into two sub problems. The first is to find those kind of item sets whose occurrence goes beyond a predefined threshold set in the database, those item sets are describe frequent or large item sets. The second problem is to produce involvement rules from those huge item sets with the restriction of minimal self-confidence. In this work, the text mining is done by dividing the given set of paragraphs into tokens and classifying them accordingly. The techniques are purely composed of sequential pattern mining, closed pattern mining & frequent pattern mining. Hence, the discovered patterns in the field of text mining cannot be used further or again. All frequently used short patterns are not useful here. In this work, an effective pattern taxonomy model & part of speech have been proposed to overcome and solve the problem of low frequency & misinterpretation.

Main Menu

Searching By

PARTNERS

Document Classification Using Part of Speech in Text Mining

Abstract

Advertisement