مرکز منطقه ای اطلاع رساني علوم و فناوري - Physiologically-motivated feature extraction for speaker identification

DocumentCode :

178047

Title :

Physiologically-motivated feature extraction for speaker identification

Author :

Jianglin Wang ; Johnson, Matthew Thomas

Author_Institution :

Dept. of Electr. & Comput. Eng., Marquette Univ., Milwaukee, WI, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

1690

Lastpage :

1694

Abstract :

This paper introduces the use of three physiologically-motivated features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC), Glottal Flow Cepstrum Coefficients (GLFCC) and Teager Phase Cepstrum Coefficients (TPCC). These features capture speaker-discriminative characteristics from different aspects of glottal source excitation patterns. The proposed physiologically-driven features give better results with lower model complexities, and also provide complementary information that can improve overall system performance even for larger amounts of data. Results on speaker identification using the YOHO corpus demonstrate that these physiologically-driven features are both more accurate than and complementary to traditional mel-frequency cepstral coefficients (MFCC). In particular, the incorporation of the proposed glottal source features offers significant overall improvement to the robustness and accuracy of speaker identification tasks.

Keywords :

cepstral analysis; feature extraction; speaker recognition; GLFCC; RPCC; TPCC; YOHO corpus; glottal flow cepstrum coefficients; glottal source excitation patterns; model complexities; physiologically-motivated feature extraction; residual phase cepstrum coefficients; speaker identification; speaker-discriminative characteristics; teager phase cepstrum coefficients; Accuracy; Cepstrum; Feature extraction; Mel frequency cepstral coefficient; Speaker recognition; Speech; Speech processing; Glottal source excitation and GMM-UBM; Speaker distinctive feature; Speaker identification;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6853886

Filename :

6853886

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=178047