ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

The Automated VSMs to Categorize Arabic Text Data Sets

Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY (Vol.13, No. 1)

Publication Date:

Authors : ; ;

Page : 4074-4081

Keywords : Arabic data sets; Data mining; Text categorisation; Term weighting; VSM.;

Source : Download Find it from : Google Scholarexternal

Abstract

Text Categorization is one of the most important tasks in information retrieval and data mining. This paper aims at investigating different variations of vector space models (VSMs) using KNN algorithm. we used 242 Arabic abstract documents that were used by (Hmeidi & Kanaan, 1997). The bases of our comparison are the most popular text evaluation measures; we use Recall measure, Precision measure, and F1 measure. The Experimental results against the Saudi data sets reveal that Cosine outperformed over of the Dice and Jaccard coefficients.

Last modified: 2016-06-29 17:56:55