Spoken Language Identification using CNN with Log Mel Spectrogram Features in Indian Context
Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.11, No. 6)Publication Date: 2022-12-10
Authors : Sreedhar Potla B. Vishnu Vardhan;
Page : 275-279
Keywords : Log Mel Spectrogram(LMS); Convolutional Neural Networks (CNN); IncepetionV3; Resnet50; Language identification (LID).;
Abstract
This study demonstrates a novel application of Log Mel Spectrogram coefficients to image classification via Convolutional Neural Networks (CNN). The acoustic features obtained as log mel spectrogram images are used in this article. Log mel spectrogram pictures, a novel technique, ensure that the system is noise-resistant and free of channel mismatch. The majority of Indian languages from our own dataset were used.With the use of auditory features integrated in CNN, we hope to quickly and accurately detect a language. InceptionV3 and Resnet50 models are also used in this study for performance analysis. When compared to the existing system, these approaches achieved significant improvements in language identification accuracy.
Other Latest Articles
- Simulation Based Analysis of Hierarchical Timed Colored Petri Nets Model of the Restaurant Food Serving Process
- On the Adoption of Emerging Technologies in Securing and Confidentializing the Healthcare Medical Data: Survey Study
- Automatic Change Detection on Satellite Images using Principal Component Analysis, ISODATA and Fuzzy C-Means Methods
- A Prefatory Approach Towards Monitoring and Detection of Fire Outbreak Using Artificial Neural Network
- Optimal Placement and Sizing of DGs in Distribution Networks Using Dandelion Optimization Algorithm: Case Study of an Algerian Distribution Network
Last modified: 2022-12-10 14:05:40