Recognition of Spoken Bengali Numerals Using MLP, SVM, RF Based Models with PCA Based Feature Summarization
Journal: The International Arab Journal of Information Technology (Vol.15, No. 2)Publication Date: 2018-03-01
Authors : Avisek Gupta; Kamal Sarkar;
Page : 263-269
Keywords : Speech recognition; isolated digits; principal component analysis; support vector machines; multi-layered perceptrons; random forests.;
Abstract
This paper presents a method of automatic recognition of Bengali numerals spoken in noise-free and noisy environments by multiple speakers with different dialects. Mel Frequency Cepstral Coefficients (MFCC) are used for feature extraction, and Principal Component Analysis is used as a feature summarizer to form the feature vector from the MFCC data for each digit utterance. Finally, we use Support Vector Machines, Multi-Layer Perceptrons, and Random Forests to recognize the Bengali digits and compare their performance. In our approach, we treat each digit utterance as a single indivisible entity, and we attempt to recognize it using features of the digit utterance as a whole. This approach can therefore be easily applied to spoken digit recognition tasks for other languages as well
Other Latest Articles
- Comparison of Dimension Reduction Techniques on High Dimensional Datasets
- Pair Programming for Software Engineering Education: An Empirical Study
- Hybrid Algorithm with Variants for Feed Forward Neural Network
- Revisiting Constraint Based Geo Location: Improving Accuracy through Removal of Outliers
- Fuzzy Logic based Decision Support System for Component Security Evaluation
Last modified: 2019-04-29 20:47:22