DocumentCode :
1020910
Title :
Speaker recognition using neural networks and conventional classifiers
Author :
Farrell, Kevin R. ; Mammone, Richard J. ; Assaleh, Khaled T.
Author_Institution :
CAIP Center, Rutgers Univ., Piscataway, NJ, USA
Volume :
2
Issue :
1
fYear :
1994
Firstpage :
194
Lastpage :
205
Abstract :
An evaluation of various classifiers for text-independent speaker recognition is presented. In addition, a new classifier is examined for this application. The new classifier is called the modified neural tree network (MNTN). The MNTN is a hierarchical classifier that combines the properties of decision trees and feedforward neural networks. The MNTN differs from the standard NTN in both the new learning rule used and the pruning criteria. The MNTN is evaluated for several speaker recognition experiments. These include closed- and open-set speaker identification and speaker verification. The database used is a subset of the TIMIT database consisting of 38 speakers from the same dialect region. The MNTN is compared with nearest neighbor classifiers, full-search, and tree-structured vector quantization (VQ) classifiers, multilayer perceptrons (MLPs), and decision trees. For closed-set speaker identification experiments, the full-search VQ classifier and MNTN demonstrate comparable performance. Both methods perform significantly better than the other classifiers for this task. The MNTN and full-search VQ classifiers are also compared for several speaker verification and open-set speaker-identification experiments. The MNTN is found to perform better than full-search VQ classifiers for both of these applications. In addition to matching or exceeding the performance of the VQ classifier for these applications, the MNTN also provides a logarithmic saving for retrieval.
Keywords :
decision theory; feedforward neural nets; speech recognition; trees (mathematics); vector quantisation; MNTN; TIMIT database; closed-set speaker identification; decision trees; feedforward neural networks; full-search VQ; hierarchical classifier; learning rule; modified neural tree network; multilayer perceptrons; nearest neighbor classifiers; neural networks; open-set speaker identification; pruning criteria; speaker recognition experiments; speaker verification; text-independent speaker recognition; tree-structured vector quantization; Classification tree analysis; Databases; Decision trees; Feature extraction; Multilayer perceptrons; Nearest neighbor searches; Neural networks; Signal processing; Speaker recognition; Vector quantization;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/89.260362
Filename :
260362
Link To Document :
بازگشت