DocumentCode :
1263916
Title :
Capture interspeaker information with a neural network for speaker identification
Author :
Wang, Lan ; Chen, Ke ; Chi, Huisheng
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Volume :
13
Issue :
2
fYear :
2002
fDate :
3/1/2002 12:00:00 AM
Firstpage :
436
Lastpage :
445
Abstract :
Model-based approach is one of methods widely used for speaker identification, where a statistical model is used to characterize a specific speaker´s voice but no interspeaker information is involved in its parameter estimation. It is observed that interspeaker information is very helpful in discriminating between different speakers. In this paper, we propose a novel method for the use of interspeaker information to improve performance of a model-based speaker identification system. A neural network is employed to capture the interspeaker information from the output space of those statistical models. In order to sufficiently utilize interspeaker information, a rival penalized encoding rule is proposed to design supervised learning pairs. For better generalization, moreover, a query-based learning algorithm is presented to actively select the input data of interest during training of the neural network. Comparative results on the KING speech corpus show that our method leads to a considerable improvement for a model-based speaker identification system
Keywords :
learning (artificial intelligence); neural nets; parameter estimation; speaker recognition; KING speech corpus; interspeaker information; parameter estimation; performance; query-based learning; rival penalized encoding scheme; speaker identification; statistical model; supervised learning; Encoding; Helium; Hidden Markov models; Indexing; Loudspeakers; Neural networks; Parameter estimation; Signal processing; Speech; Supervised learning;
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/72.991429
Filename :
991429
Link To Document :
بازگشت