DocumentCode
1490904
Title
Selecting Feature Frames for Automatic Speaker Recognition Using Mutual Information
Author
Jung, Chi-Sang ; Kim, Moo Young ; Kang, Hong-Goo
Author_Institution
Sch. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
Volume
18
Issue
6
fYear
2010
Firstpage
1332
Lastpage
1340
Abstract
In this paper, an information theoretic approach to selecting feature frames for speaker recognition systems is proposed. A conventional approach in which the frame shift is fixed to around half of the frame length may not be the best choice, because the characteristics of the speech signal may rapidly change, especially at phonetic boundaries. Experimental results show that the recognition accuracy increases if the frame interval is directly controlled using phonetic information. By applying these results to the well-known fact that the recognition accuracy is directly correlated with the amount of mutual information, this paper suggests a novel feature frame selection method for speaker recognition. Specifically, feature frames are chosen to have minimum-redundancy within selected feature frames, but maximum-relevancy to speaker models. It is verified by experiments that the proposed method produces consistent improvement, especially in a speaker verification system. It is also robust against variations in acoustic environment.
Keywords
speaker recognition; acoustic environment; automatic speaker recognition; feature frame selection method; frame length; frame shift; information theoretic approach; phonetic boundary; speaker verification system; speech signal; Feature frame selection; maximum-relevancy; minimum-redundancy; speaker recognition system;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher
ieee
ISSN
1558-7916
Type
jour
DOI
10.1109/TASL.2009.2033631
Filename
5276841
Link To Document