DocumentCode
3587724
Title
Histogram transform model using MFCC features for text-independent speaker identification
Author
Hong Yu ; Zhanyu Ma ; Minyue Li ; Jun Guo
Author_Institution
Pattern Recognition & Intell. Syst. Lab., Beijing Univ. of Posts & Telecommun., Beijing, China
fYear
2014
Firstpage
500
Lastpage
504
Abstract
A novel text-independent speaker identification (SI) method is proposed in this paper. This method uses the mel-frequency cepstral coefficients (MFCCs) and the dynamic information among adjacent frames as feature set to capture the speaker´s characteristics. In order to utilize dynamic information, we design super MFCCs feature by cascading 3 neighboring MFCCs frames together. The probability density function (PDF) of these super MFCCs features is estimated by the recently proposed histogram transform (HT) method, which generated more training data by random transforms to realize the histogram PDF estimation and recede the discontinuity problem of the common multivariate histograms computing. Compared to the conventional PDF estimation method, such as Gaussian mixture model, the HT model shows promising improvement in a SI task.
Keywords
cepstral analysis; probability; speaker recognition; transforms; HT method; common multivariate histogram computation; discontinuity problem; dynamic information; feature set; histogram PDF estimation; histogram transform model; mel-frequency cepstral coefficients; probability density function; random transforms; speaker characteristics; super MFCC feature design; super MFCC feature estimation; text-independent SI method; text-independent speaker identification; Feature extraction; Histograms; Mel frequency cepstral coefficient; Silicon; Speech; Training data; Transforms; Gaussian mixture model; Speaker identification; histogram transform model; mel-frequency cepstral coefficients;
fLanguage
English
Publisher
ieee
Conference_Titel
Signals, Systems and Computers, 2014 48th Asilomar Conference on
Print_ISBN
978-1-4799-8295-0
Type
conf
DOI
10.1109/ACSSC.2014.7094494
Filename
7094494
Link To Document