The Performance and Classifications of Audio-Visual Speech Recognition by Using the Dynamic Visual Features Extractions
Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.8, No. 5)Publication Date: 2019-10-15
Authors : Muhammad Ismail Mohmand Amiya Bhaumik Muhammad Humayun; Qayyum Shah;
Page : 2049-2053
Keywords : Automatic Speech Recognition ASR; Audio-Visual Speech Recognition AVSR; Region of Interest; Hidden Markov Model.;
Abstract
Performance and classifications of the human speech recognition is bi-modular in nature and the expansion of visual data from the speaker's mouth area has been appeared to expand the presentation of the automatic speech recognition ASR frameworks. The actual performance and classifications of the audio visual speech recognitions break down quickly within the sight of even moderate commotion, however can be high quality by including visual data from the speaker mouth region. Therefore, the new methodology taken in this paper is to consolidate dynamic data caught from the speaker mouth happening during progressive casings of video got during expressed discourse. Furthermore, the audio only, visual only and audio visual recognizers were contemplated within the sight of commotion and demonstrate that the broad media recognizer has increasingly dynamic implementation.
Other Latest Articles
- A Security Approach for File Management System using Data Encryption Standard (DES) algorithm
- Multi-domain Cross-dataset and Camera Style Transfer for Person Re-Identification
- Training and Development of Professional Skills: An Analysis of activity in Professional Skills
- An Assessment of the Visual Features Extractions for the Audio-Visual Speech Recognition
- Effective Zone Based Routing Protocols for MANET
Last modified: 2019-11-11 17:59:21