• DocumentCode
    2436047
  • Title

    Speaker Recognition in Adverse Conditions

  • Author

    Iyer, Ananth N. ; Ofoegbu, Uchechukwu O. ; Yantorno, Robert E. ; Wenndt, Stanley J.

  • Author_Institution
    Temple Univ., Philadelphia
  • fYear
    2007
  • fDate
    3-10 March 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Recognizing speakers from their voices is a challenging area of research with several practical applications. Presently speaker verification (SV) systems achieve a high level of accuracy under ideal conditions such as, when there is ample data to build speaker models and when speaker verification is performed in the presence of little or no interference. In general, these systems assume that the features extracted from the data follow a particular parametric probability density function (pdf), i.e., Gaussian or a mixture of Gaussians; where a form of the pdf is imposed on the speech data rather than determining the underlying structure of the pdf. In practical conditions, like in an aircraft cockpit where most of the verbal communication is in the form of short commands, it is almost impossible to ascertain that the assumptions made about the structure of the pdf are correct, and wrong assumptions could lead to significant reduction in performance of the SV system. In this research, non-parametric strategies, to statistically model speakers are developed and evaluated. Non-parametric density estimation methods are generally known to be superior when limited data is available for model building and SV. Experimental evaluation has shown that the non-parametric system yielded a 70% accuracy level in speaker verification with only 0.5 seconds of data and under the influence of noise with signal-to-noise ratio of 5dB. This result corresponds to a 20% decrease in error when compared to the parametric system.
  • Keywords
    feature extraction; probability; speaker recognition; aircraft cockpit; feature extraction; nonparametric density estimation methods; parametric probability density function; signal-to-noise ratio; speaker recognition; speaker verification; verbal communication; Aircraft; Buildings; Data mining; Feature extraction; Interference; Probability density function; Signal to noise ratio; Speaker recognition; Speech recognition; Yield estimation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Aerospace Conference, 2007 IEEE
  • Conference_Location
    Big Sky, MT
  • ISSN
    1095-323X
  • Print_ISBN
    1-4244-0524-6
  • Electronic_ISBN
    1095-323X
  • Type

    conf

  • DOI
    10.1109/AERO.2007.352976
  • Filename
    4161416