Text-independent speaker identification

Author

Gish, H. ; Schmidt, Michael

Author_Institution

BBN Syst. & Technol. Corp., Cambridge, MA, USA

Volume

11

Issue

4

fYear

1994

Firstpage

18

Lastpage

32

Abstract

We describe current approaches to text-independent speaker identification based on probabilistic modeling techniques. The probabilistic approaches have largely supplanted methods based on comparisons of long-term feature averages. The probabilistic approaches have an important and basic dichotomy into nonparametric and parametric probability models. Nonparametric models have the advantage of being potentially more accurate models (though possibly more fragile) while parametric models that offer computational efficiencies and the ability to characterize the effects of the environment by the effects on the parameters. A robust speaker-identification system is presented that was able to deal with various forms of anomalies that are localized in time, such as spurious noise events and crosstalk. It is based on a segmental approach in which normalized segment scores formed the basic input for a variety of robust 43% procedures. Experimental results are presented, illustrating 59% the advantages and disadvantages of the different procedures. 64%. We show the role that cross-validation can play in determining how to weight the different sources of information when combining them into a single score. Finally we explore a Bayesian approach to measuring confidence in the decisions made, which enabled us to reject the consideration of certain tests in order to achieve an improved, predicted performance level on the tests that were retained.<>

Keywords

Bayes methods; crosstalk; speech recognition; Bayesian approach; confidence measurement; crosstalk; experimental results; nonparametric models; parametric models; performance level; probabilistic modeling; robust speaker-identification system; segmental approach; spurious noise events; text-independent speaker identification; Automatic speech recognition; Automatic testing; Communication channels; Humans; Pattern classification; Robustness; Speaker recognition; Speech recognition; Telephony; Text recognition;

fLanguage

English

Journal_Title

Signal Processing Magazine, IEEE

Publisher

ieee

ISSN

1053-5888

Type

jour

DOI

10.1109/79.317924

Filename

317924