ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Automatic Text Categorization on News Articles

Journal: International Journal of Engineering and Techniques (Vol.2, No. 3)

Publication Date:

Authors : ;

Page : 33-38

Keywords : Text categorization; Preprocessing; K-mean; TF-IDF etc;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Text categorization is a term that has intrigued researchers for quite some time now. It is the concept in which news articles are categorized into specific groups to cut down efforts put in manually categorizing news articles into particular groups. A growing number of statistical classification and machine learning technique have been applied to text categorization. This paper is based on the automatic text categorization of news articles based on clustering using k-mean algorithm. The goal of this paper is to automatically categorize news articles into groups. Our paper mostly concentrates on K-mean for clustering and for term frequency we are going to use TF-IDF dictionary is applied for categorization. This is done using mahaout as platform.

Last modified: 2018-05-18 19:10:50