LIP READING USING DEEP LEARNING TECHNIQUES
Journal: International Journal of Advanced Research in Engineering and Technology (IJARET) (Vol.11, No. 12)Publication Date: 2020-12-31
Authors : Rajneel Dutta Shridevi. S Thavavel Vaiyapuri;
Page : 2088-2099
Keywords : Deep Learning; Convolutional Neural Network; Image classification; Lip Reading.;
Abstract
Deep Learning in layman terms is basically stacking together multiple layers of neurons. These neurons lie on connected layers where the output from the previous layer is passed or connected to every other node in the next layer. In general, the performance of neural networks is optimized by stacking several layers together. In late programmed discourse acknowledgment considerations, profound learning engineering applications for acoustic include eclipsed regular sound highlights, for example, Mel-recurrence cepstral coefficient. Be that as it may, for visual discourse acknowledgment (VSR) studies, high-quality visual element extraction systems are still broadly used. Right now, this paper aims to apply convolutional neural system (CNN) as a visual component extraction instrument for VSR. Here, three different variants of CNN is developed to process the pictures of a speaker's mouth zone in blend with phoneme marks. Later, the developed CNN variants are used to remove visual highlights basic for perceiving phonemes and are evaluated for their performance with the most widely used benchmark dataset, MIRACL-VC1.
Other Latest Articles
- Surgery and Chronic Stress// Ultimately Leading to Major Health Risks
- Evaluation of Vitamin D3 level in Sino-nasal Polypi
- OPTICAL RECOGNITION OF OLD HANDWRITTEN MUSIC SYMBOLS BASED ON TEXTURE DESCRIPTORS
- Acute Renal Failure in Neonates with Perinatal Asphyxia and its Correlation with HIE Staging: A Prospective Case Control Study
- Surgical Aspects of Space Medicine
Last modified: 2021-02-24 18:08:12