Document Classification Using Part of Speech in Text Mining
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 12)Publication Date: 2015-12-05
Authors : Sonam Tripathi; Tripti Sharma;
Page : 2004-2008
Keywords : Text mining; Association rule; Sequential pattern mining; Closed pattern mining; Frequent pattern mining;
Abstract
Text mining is a practice that is used to find beneficial in arrangement from the large amount of data sets. Data mining has guidelines called as frequent pattern and association rule that is important for finding frequent patterns. Text Mining is the detection by computer of new, previously unidentified in arrangement by automatically mining in arrangement from dissimilar written resources. Text mining methods are the fundamental and permitting tools for efficient organization, triangulation, retrieval and summarization of large document quantity. The problem is more often than not decomposed into two sub problems. The first is to find those kind of item sets whose occurrence goes beyond a predefined threshold set in the database, those item sets are describe frequent or large item sets. The second problem is to produce involvement rules from those huge item sets with the restriction of minimal self-confidence. In this work, the text mining is done by dividing the given set of paragraphs into tokens and classifying them accordingly. The techniques are purely composed of sequential pattern mining, closed pattern mining & frequent pattern mining. Hence, the discovered patterns in the field of text mining cannot be used further or again. All frequently used short patterns are not useful here. In this work, an effective pattern taxonomy model & part of speech have been proposed to overcome and solve the problem of low frequency & misinterpretation.
Other Latest Articles
- Notes on Occurrence of Fruticose Lichens in Joram Top, Ziro Valley, Arunachal Pradesh with 10 New Records to the State
- Outsourcing Data on Cloud Using Aggregate Key
- Design and Simulation of Parallel Manipulator for Vehicle Driving Simulator
- Lossless and Reversible Data Hiding in Asymmetric Cryptography
- An Energy Efficient Secure Acknowledgement based Authentication Protocol for WSN
Last modified: 2021-07-01 14:28:06