A CONTACTLESS SPEAKER IDENTIFICATION APPROACH USING FEATURE-LEVEL FUSION OF SPEECH AND FACE CUES WITH DCNN
Journal: Proceedings on Engineering Sciences (Vol.6, No. 3)Publication Date: 2024-09-30
Authors : Khushboo Jha Aruna Jain Sumit Srivastava;
Page : 1047-1056
Keywords : Deep Convolutional Neural Network; Data Fusion Methods; Dimensionality Reduction; Feature Extraction Techniques; Feature Level Fusion; Multimodal Biometric System;
Abstract
This paper evaluates the effectiveness of feature-level fusion through the concatenation method, of two independent and emerging modalities, speech and face. The major benefit of face modality (physiological) is that the data acquisition does not require much user cooperation or awareness, as seen in airports or public places in mass. Speech (physiological and behavioural) based recognition, for disabled and illiterate people, is the most convenient and reliable user identification technique due to the ease with which a contactless speech-receiving device can be accessed. Furthermore, it should be noted that adverse conditions, such as low illumination for facial recognition and a noisy environment for speech recognition during data acquisition, are not interdependent and function autonomously. Consequently, the acoustic and distinctive facial features are the paramount (fused) features in achieving higher user identification accuracy. This paper aims to explore the state-of-the-art techniques for data fusion, dimensionality reduction, feature extraction (speech-face) and classifier. Based on the above findings, we have proposed an efficient feature level fusion of speech and face cues with the deep convolutional neural network as a classifier for the VidTIMIT database. We have tested the effectiveness of the proposed approach in terms of identification accuracy with different training sample sizes and numbers of users. The proposed user identification approach achieves an accuracy of 97.31%, an EER of 3.62% and outperforms the unimodal biometric system for speech and face by 3.83% and 1.59 % respectively. Additionally, the proposed approach outperformed a few existing methodologies. Thus, we can infer that even in the presence of adverse conditions, such an approach can ameliorate the user identification-based solution.
Other Latest Articles
- UNCERTAINTY EVALUATION IN SHIP REPAIR SPECIFICATION
- THE BENEFITS OF FMEA IN IMPROVING THE INDUSTRIAL PROCESS OF A CABIN AIR CARRIER
- CONSTRUCTIONS IN INDIA FOR SUSTAINABLE BUILT ENVIRONMENT BASED ON COMPRESSED STABILIZED EARTH BLOCKS (CSEB) CASE STUDIES
- STORAGE AND PROCESS IMPROVEMENT IN MANUFACTURING SYSTEM
- COMPETENCY PROFILE OF PARENTS IN THE SYSTEM OF ASSESSMENT OF THE INCLUSIVE POTENTIAL OF A FAMILY
Last modified: 2024-09-02 03:34:41