ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

An overview of speaker recognition

Journal: Trends in Computer Science and Information Technology (Vol.4, No. 1)

Publication Date:

Authors : ; ;

Page : 001-012

Keywords : Speaker recognition; Feature extraction; MFCC; Deep learning; End-to-end model;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Speaker recognition has been studied for many years and has been a hot topic. This paper presents an overview of speaker recognition methods, which include the classical and the state-of-art methods. According to the modular components of speaker recognition system, we fi rstly introducedthe fundamentals of speaker recognition, which are mainly divided into two parts: feature extraction and speaker modeling. The most commonly speech features used in speaker recognition were elaborated fi rstly. In particular, the recent progress of deep neural network proposes a new approach of feature extraction and has become the technology trend. Secondly, the classical approaches of speaker recognition model were introduced, and elaborated the recent progress of deep learning speaker recognition. This paper especially provides an in-depth analysis on end-to-end model which consists of a training component to extract features, an enrollment component to training the speaker model, and an evaluation component with appropriate loss function for optimization. The fi nal part concludes the paper with discussion on future trends.

Last modified: 2019-10-03 19:24:47