• DocumentCode
    744154
  • Title

    Cochannel Speaker Identification in Anechoic and Reverberant Conditions

  • Author

    Xiaojia Zhao ; Yuxuan Wang ; DeLiang Wang

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
  • Volume
    23
  • Issue
    11
  • fYear
    2015
  • Firstpage
    1727
  • Lastpage
    1736
  • Abstract
    Speaker identification (SID) in cochannel speech, where two speakers are talking simultaneously over a single recording channel, is a challenging problem. Previous studies address this problem in the anechoic environment under the Gaussian mixture model (GMM) framework. On the other hand, cochannel SID in reverberant conditions has not been addressed. This paper studies cochannel SID in both anechoic and reverberant conditions. We first investigate GMM-based approaches and propose a combined system that integrates two cochannel SID methods. Second, we explore deep neural networks (DNNs) for cochannel SID and propose a DNN-based recognition system. Evaluation results demonstrate that our proposed systems significantly improve SID performance over recent approaches in both anechoic and reverberant conditions and various target-to-interferer ratios.
  • Keywords
    Gaussian processes; anechoic chambers (acoustic); mixture models; neural nets; reverberation chambers; speaker recognition; DNN; GMM framework; Gaussian mixture model; SID; anechoic conditions; anechoic environment; cochannel SID; cochannel SID methods; cochannel speaker identification; cochannel speech; deep neural networks; reverberant conditions; single recording channel; Gaussian mixture model; Hidden Markov models; IEEE transactions; Speech; Speech processing; Speech recognition; Cochannel speaker identification; Gaussian mixture model (GMM); deep neural network (DNN); reverberation; target-to-interferer ratio;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    2329-9290
  • Type

    jour

  • DOI
    10.1109/TASLP.2015.2447284
  • Filename
    7128346