• DocumentCode
    1168734
  • Title

    Text-independent speaker identification

  • Author

    Gish, H. ; Schmidt, Michael

  • Author_Institution
    BBN Syst. & Technol. Corp., Cambridge, MA, USA
  • Volume
    11
  • Issue
    4
  • fYear
    1994
  • Firstpage
    18
  • Lastpage
    32
  • Abstract
    We describe current approaches to text-independent speaker identification based on probabilistic modeling techniques. The probabilistic approaches have largely supplanted methods based on comparisons of long-term feature averages. The probabilistic approaches have an important and basic dichotomy into nonparametric and parametric probability models. Nonparametric models have the advantage of being potentially more accurate models (though possibly more fragile) while parametric models that offer computational efficiencies and the ability to characterize the effects of the environment by the effects on the parameters. A robust speaker-identification system is presented that was able to deal with various forms of anomalies that are localized in time, such as spurious noise events and crosstalk. It is based on a segmental approach in which normalized segment scores formed the basic input for a variety of robust 43% procedures. Experimental results are presented, illustrating 59% the advantages and disadvantages of the different procedures. 64%. We show the role that cross-validation can play in determining how to weight the different sources of information when combining them into a single score. Finally we explore a Bayesian approach to measuring confidence in the decisions made, which enabled us to reject the consideration of certain tests in order to achieve an improved, predicted performance level on the tests that were retained.<>
  • Keywords
    Bayes methods; crosstalk; speech recognition; Bayesian approach; confidence measurement; crosstalk; experimental results; nonparametric models; parametric models; performance level; probabilistic modeling; robust speaker-identification system; segmental approach; spurious noise events; text-independent speaker identification; Automatic speech recognition; Automatic testing; Communication channels; Humans; Pattern classification; Robustness; Speaker recognition; Speech recognition; Telephony; Text recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1053-5888
  • Type

    jour

  • DOI
    10.1109/79.317924
  • Filename
    317924