مرکز منطقه ای اطلاع رساني علوم و فناوري - Speaker recognition using neural networks and conventional classifiers

DocumentCode :

1020910

Title :

Speaker recognition using neural networks and conventional classifiers

Author :

Farrell, Kevin R. ; Mammone, Richard J. ; Assaleh, Khaled T.

Author_Institution :

CAIP Center, Rutgers Univ., Piscataway, NJ, USA

Volume :

Issue :

fYear :

1994

Firstpage :

194

Lastpage :

205

Abstract :

An evaluation of various classifiers for text-independent speaker recognition is presented. In addition, a new classifier is examined for this application. The new classifier is called the modified neural tree network (MNTN). The MNTN is a hierarchical classifier that combines the properties of decision trees and feedforward neural networks. The MNTN differs from the standard NTN in both the new learning rule used and the pruning criteria. The MNTN is evaluated for several speaker recognition experiments. These include closed- and open-set speaker identification and speaker verification. The database used is a subset of the TIMIT database consisting of 38 speakers from the same dialect region. The MNTN is compared with nearest neighbor classifiers, full-search, and tree-structured vector quantization (VQ) classifiers, multilayer perceptrons (MLPs), and decision trees. For closed-set speaker identification experiments, the full-search VQ classifier and MNTN demonstrate comparable performance. Both methods perform significantly better than the other classifiers for this task. The MNTN and full-search VQ classifiers are also compared for several speaker verification and open-set speaker-identification experiments. The MNTN is found to perform better than full-search VQ classifiers for both of these applications. In addition to matching or exceeding the performance of the VQ classifier for these applications, the MNTN also provides a logarithmic saving for retrieval.

Keywords :

decision theory; feedforward neural nets; speech recognition; trees (mathematics); vector quantisation; MNTN; TIMIT database; closed-set speaker identification; decision trees; feedforward neural networks; full-search VQ; hierarchical classifier; learning rule; modified neural tree network; multilayer perceptrons; nearest neighbor classifiers; neural networks; open-set speaker identification; pruning criteria; speaker recognition experiments; speaker verification; text-independent speaker recognition; tree-structured vector quantization; Classification tree analysis; Databases; Decision trees; Feature extraction; Multilayer perceptrons; Nearest neighbor searches; Neural networks; Signal processing; Speaker recognition; Vector quantization;

fLanguage :

English

Journal_Title :

Speech and Audio Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1063-6676

Type :

jour

DOI :

10.1109/89.260362

Filename :

260362

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1020910