ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A New Algorithm For Document Classification Based On Weighting Features And Files

Journal: International Journal of Scientific Engineering and Technology (IJSET) (Vol.5, No. 5)

Publication Date:

Authors : ;

Page : 296-300

Keywords : documents classification; weighting features; retrieving documents.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

With regard to the increasing amount of information in the present world, there is increasing need for new powerful instruments for changing data to useful knowledge. One of the vital ways of controlling and managing data is classifying texts. This article presents an algorithm for classifying documents. It has capabilities such as quality control of created classification based on feedback from F evaluation measure, weighting features based on the classes, assigning weight to each file in all classes and transferring file to a class that has the most weight. This procedure deletes the redundancy words with high quality due to improvement in classes. Finally we evaluate the algorithm, that is, first, the influence of different early random classifications are studied, then the influence of different weighing methods TFCRF?TFRF?TFIDF and the proposed weighing method is investigated on the output of the proposed classification algorithm . Finally, the proposed algorithm is compared with other algorithms. The results show that all mentioned cases collectively increase quality and accuracy of the classification

Last modified: 2016-06-06 01:32:25