The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.8, No. 5)

Publication Date: 2019-10-15

Authors : Muhammad Ismail Mohmand Amiya Bhaumik Muhammad Humayun; Qayyum Shah;

Page : 2049-2053

Keywords : Automatic Speech Recognition ASR; Audio-Visual Speech Recognition AVSR; Region of Interest; Hidden Markov Model.;

Source : Download Find it from : Google Scholar

Abstract

Performance and classifications of the human speech recognition is bi-modular in nature and the expansion of visual data from the speaker's mouth area has been appeared to expand the presentation of the automatic speech recognition ASR frameworks. The actual performance and classifications of the audio visual speech recognitions break down quickly within the sight of even moderate commotion, however can be high quality by including visual data from the speaker mouth region. Therefore, the new methodology taken in this paper is to consolidate dynamic data caught from the speaker mouth happening during progressive casings of video got during expressed discourse. Furthermore, the audio only, visual only and audio visual recognizers were contemplated within the sight of commotion and demonstrate that the broad media recognizer has increasingly dynamic implementation.

Main Menu

Searching By

PARTNERS

The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions

Abstract

Advertisement