Automatic Distribution of Documents into Different Categories using Active Learning
Journal: International Journal of Science and Research (IJSR) (Vol.2, No. 2)Publication Date: 2013-02-05
Authors : M. Jayaprakash; D. John Aravindhar; E. R. Naganathan;
Page : 599-602
Keywords : Text categorization; Novel Active learning; Manifold learning; labeled data;
Abstract
Data mining extracts novel and useful knowledge from large repositories of data and has become an effective analysis and decision means in corporation in many information processing tasks, labels are usually expensive and the unlabeled data points are abundant. To reduce the cost on collecting labels, it is crucial to predict which unlabeled examples are the most informative, i. e. , improve the classifier the most if they were labeled. Many active learning techniques have been proposed for text categorization, such as SVM Active and Transductive Experimental Design. However, most of previous approaches try to discover the discriminant structure of the data space, whereas the geometrical structure is not well respected. By minimizing the expected error with respect to the optimal classifier, they can select the most representative and discriminative data points for labeling. Experimental results on text categorization have demonstrated the effectiveness of proposed approach.
Other Latest Articles
- Occurrence and Antibiogram of Escherichia Coli O157:H7 in Locally Fermented Milk (Nono) Sold Under Market Conditions in Nasarawa State, Nigeria
- Ultrasound Imaging in the Diagnosis and Assessment of Testicular Disease
- Impact of Training in Pharmaceutical Industry: An Assessment on Square Pharmaceuticals Limited, Bangladesh
- Thyroid Nodules Evaluation with Sonography
- An Intelligent Gateway Scheme for Power Aware Routing in Mobile Ad Hoc Network
Last modified: 2021-06-30 20:12:44