ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

An Assessment of the Visual Features Extractions for the Audio-Visual Speech Recognition

Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.8, No. 5)

Publication Date:

Authors : ; ;

Page : 2023-2028

Keywords : Audio Visual Speech Recognition (AVSR); Motion Vector; Hidden Markov Model.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Utilization of the visual data from the speakers mouth region has appeared to develop presentation of the Automatic Speech-Recognition ASR frameworks. This is the particularly valuable in nearness of the clamor, which uniform in the moderate structure seriously debases discourse acknowledgment execution of frameworks utilizing just sound data. Different arrangements of highlights separated from speakers mouth area have been utilized to improve the showing of an ASR framework. In such testing situations and have met various triumphs, and to the best of creators information, the impact of utilizing these methods on the acknowledgment execution based on the phonemes have not been examined at this point. This paper presents examination of phoneme acknowledgement execution utilising visual highlights removed from mouth area of-enthusiasm utilising discrete cosine transform and discrete wavelet transform. Therefore, new discrete cosine transform and discrete wavelet transform feature have likewise been extricated and contrasted and the recently utilized one. These highlights were utilized alongside sound highlights dependent on the MelFrequency Cepstral Coefficients MFCCs. This recent research will help in the choosing appropriate feature for various application as well as distinguish the restrictions of these techniques in the acknowledgment of the individual-phonemes.

Last modified: 2019-11-11 17:52:37