Tunisian Dialect Recognition Based on Hybrid Techniques
Journal: The International Arab Journal of Information Technology (Vol.15, No. 1)Publication Date: 2018-01-01
Authors : Mohamed Hassine Lotfi Boussaid Hassani Massaoud;
Page : 58-65
Keywords : Vector Quantization (VQLBG); Mel Frequency Cepstral Coefficients (MFCCs); Feed-Forward Back Propagation Neural Networks (FFBPNN); Speaker Dependent System;
Abstract
In this research paper, an Arabic Automatic Speech Recognition System is implemented in order to recognize ten Arabic digits (from zero to nine) spoken in Tunisian dialect (Darija). This system is divided in two main modules: The feature extraction module by combining a few conventional feature extraction techniques, and the recognition module by using FeedForward Back Propagation Neural Networks (FFBPNN). For this purpose, four oral proper corpora are prepared by five speakers each. Each speaker pronounced the ten digits five times. The chosen speakers are different in gender, age and physiological conditions. We focus our experiments on a speaker dependent system and we also examined the case of speaker independent system. The obtained recognition performances are almost ideal and reached up to 98.5% when we use for the feature extraction phase the Perceptual Linear Prediction technique (PLP) followed firstly by its first-order temporal derivative (∆PLP ) and secondly by Vector Quantization of Linde-Buzo-Gray (VQLBG).
Other Latest Articles
- Financial Time Series Forecasting Using Hybrid Wavelet-Neural Model
- Efficient Parameterized Matching Using BurrowsWheeler Transform
- Bag-of-Visual-Words Model for Fingerprint Classification
- A Framework for Recognition and Animation of Chess Moves Printed on a Chess Book
- Opinion within Opinion: Segmentation Approach for Urdu Sentiment Analysis
Last modified: 2019-04-29 18:35:09