ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Text Categorization using Jaccard Coefficient for Text Messages

Journal: International Journal of Science and Research (IJSR) (Vol.5, No. 5)

Publication Date:

Authors : ; ;

Page : 2046-2050

Keywords : Document Classification; Natural Language processing; Information retrieval; Text mining;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

There is wide growth in web application and electronic documents in day to day which needs automatic text classification of documents. Proper Classification methods provide the good results of the experiment and gives proper direction to the further processing of the text. The text is e-documents, news report, blogs, messages, comments on social media, e-books, web content etc which required text mining to extract meaningful knowledge from it. Some natural language techniques and machine learning algorithm are good to get the meaning of that e-document and classify them. There are lots of techniques are there for classification of the text documents, this paper is to understand different techniques and highlight the important methodology among them and helpful to selecting the classification technique which is appropriate to the text-classification process. And detail implementation of one of this method to classify the text message in two categories according the terms found in it. The coming text message is suspicious or not. In this case the Jaccard coefficient method gives the best result to classify message according to the words found in it. Text classification processes include several steps such as feature selection, vector representation and learning algorithm.

Last modified: 2021-07-01 14:37:34