Research on Large Scale Data Sets Categorization Based on SVM
Proceeding: The Fourth International Conference on Informatics & Applications (ICIA2015)Publication Date: 2015-07-20
Authors : Yongli Li; Liyan Dong; Minghui Sun; Hongjie Wang; Le Huang; Xinxin Wang; Meichen Dong;
Page : 51-61
Keywords : SVM; Large Data sets; Fuzzy Clustering; Category; Classification.;
Abstract
Support Vector Machines algorithms are not appropriate for the large data sets because of high training complexity. To address this issue, this paper presents a two stage SVM classification algorithm based on fuzzy clustering. The algorithm is divided into two phases. In the first phase, an approximate decision hyper-plane is obtained by weighted SVM which using the data after the fuzzy clustering as training data sets. In the second phase, the decision hyper-plane is obtained by SVM using the data near to the approximate hyper-plane obtained in the first phase. Experimental results demonstrate that our approach has good classification accuracy while the training is significantly faster than the standard SVM. The improved approach has a distinctive advantage on dealing with huge data sets.
Other Latest Articles
- An Approach for Signal Discrimination Based on Category-Preserving BoW
- Statistical Analysis Between Malware and Benign Based on IA-32 Instruction
- Strongly Unforgeable Revocable Certificateless Signature
- Measuring semantic similarity in WordNet by using Neural Network and Differential Evolution Algorithm
- Adaptive Memory Matrices for Automatic Termination of Evolutionary Algorithms
Last modified: 2015-08-10 22:21:09