ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.8, No. 5)

Publication Date:

Authors : ; ;

Page : 2049-2053

Keywords : Automatic Speech Recognition ASR; Audio-Visual Speech Recognition AVSR; Region of Interest; Hidden Markov Model.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Performance and classifications of the human speech recognition is bi-modular in nature and the expansion of visual data from the speaker's mouth area has been appeared to expand the presentation of the automatic speech recognition ASR frameworks. The actual performance and classifications of the audio visual speech recognitions break down quickly within the sight of even moderate commotion, however can be high quality by including visual data from the speaker mouth region. Therefore, the new methodology taken in this paper is to consolidate dynamic data caught from the speaker mouth happening during progressive casings of video got during expressed discourse. Furthermore, the audio only, visual only and audio visual recognizers were contemplated within the sight of commotion and demonstrate that the broad media recognizer has increasingly dynamic implementation.

Last modified: 2019-11-11 17:59:21