DocumentCode
1168734
Title
Text-independent speaker identification
Author
Gish, H. ; Schmidt, Michael
Author_Institution
BBN Syst. & Technol. Corp., Cambridge, MA, USA
Volume
11
Issue
4
fYear
1994
Firstpage
18
Lastpage
32
Abstract
We describe current approaches to text-independent speaker identification based on probabilistic modeling techniques. The probabilistic approaches have largely supplanted methods based on comparisons of long-term feature averages. The probabilistic approaches have an important and basic dichotomy into nonparametric and parametric probability models. Nonparametric models have the advantage of being potentially more accurate models (though possibly more fragile) while parametric models that offer computational efficiencies and the ability to characterize the effects of the environment by the effects on the parameters. A robust speaker-identification system is presented that was able to deal with various forms of anomalies that are localized in time, such as spurious noise events and crosstalk. It is based on a segmental approach in which normalized segment scores formed the basic input for a variety of robust 43% procedures. Experimental results are presented, illustrating 59% the advantages and disadvantages of the different procedures. 64%. We show the role that cross-validation can play in determining how to weight the different sources of information when combining them into a single score. Finally we explore a Bayesian approach to measuring confidence in the decisions made, which enabled us to reject the consideration of certain tests in order to achieve an improved, predicted performance level on the tests that were retained.<>
Keywords
Bayes methods; crosstalk; speech recognition; Bayesian approach; confidence measurement; crosstalk; experimental results; nonparametric models; parametric models; performance level; probabilistic modeling; robust speaker-identification system; segmental approach; spurious noise events; text-independent speaker identification; Automatic speech recognition; Automatic testing; Communication channels; Humans; Pattern classification; Robustness; Speaker recognition; Speech recognition; Telephony; Text recognition;
fLanguage
English
Journal_Title
Signal Processing Magazine, IEEE
Publisher
ieee
ISSN
1053-5888
Type
jour
DOI
10.1109/79.317924
Filename
317924
Link To Document