Automatic Speaker Recognition Based on Mel-Frequency Cepstral Coefficients and Gaussian Mixture Models
Journal: Mehran University Research Journal of Engineering and Technology (Vol.32, No. 4)Publication Date: 2012-10-01
Authors : Memon S.; Bhatti S.; Abro F.R.;
Page : 543-550
Keywords : Mel Frequency Cepstral Coefficients; Gaussian Mixture Models; Expectation Maximization; Speaker Recognition;
Abstract
This paper investigates the task of SR (Speaker Recognition) for the state-of-the-art techniques. The paper initially presents the technical description of automatic SR, followed by the comparative analysis of a number of methods available for feature extraction and modeling. Based on this analysis the NIST 2001, NIST 2002, NIST 2004 and NIST 2006 Speaker recognition corpora are used to investigate the state of the art feature extraction and modeling techniques. The state of the art technique for feature extraction is delta MFCC ( Mel Frequency Cepstral Coefficients) and for modeling is GMM (Gaussian Mixture Models) based on EM (Expectation Maximization). Further in this paper the details about the enrollment/training and recognition/testing is also presented. For different stages of SR systems the conventional methods are summarized
Other Latest Articles
- Penelitian pengaruh penambahan karet reclaim terhadap sifat- sifat kuat tarik, kemuluran dan volume terkikis kompon karet
- Penetuan jumlah DOP yang tepat dalam kelompok sol plastik PVC
- Pengaruh penambahan filler kapur dan kaolin terhadap kekerasan dan pukul takik kompon PVC
- Mutu kulit jaket dari kulit domba peranakan merino
- Mutu kulit glase dari kulit domba peranakan merino
Last modified: 2016-02-15 14:19:35