• DocumentCode
    3787
  • Title

    Robust Speaker Identification in Noisy and Reverberant Conditions

  • Author

    Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • Volume
    22
  • Issue
    4
  • fYear
    2014
  • fDate
    Apr-14
  • Firstpage
    836
  • Lastpage
    845
  • Abstract
    Robustness of speaker recognition systems is crucial for real-world applications, which typically contain both additive noise and room reverberation. However, the combined effects of additive noise and convolutive reverberation have been rarely studied in speaker identification (SID). This paper addresses this issue in two phases. We first remove background noise through binary masking using a deep neural network classifier. Then we perform robust SID with speaker models trained in selected reverberant conditions, on the basis of bounded marginalization and direct masking. Evaluation results show that the proposed system substantially improves SID performance over related systems in a wide range of reverberation time and signal-to-noise ratios.
  • Keywords
    convolution; interference suppression; neural nets; reverberation; signal classification; speaker recognition; speech intelligibility; additive noise; background noise removal; binary masking; bounded marginalization; convolutive reverberation; deep neural network classifier; direct masking; noisy conditions; reverberant conditions; reverberation time; robust SID; room reverberation; signal-to-noise ratios; speaker identification; speaker model; Feature extraction; Noise; Noise measurement; Reverberation; Robustness; Speech; Training; Deep neural network; ideal binary mask; noise; reverberation; robust speaker identification;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2014.2308398
  • Filename
    6747994